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Preface 


isk management practices and the regulation of financial institutions have con- 

tinued to evolve in the past three years. Risk Management and Financial Institu- 

tions has been expanded and updated to reflect this. Like my other popular text 
Options, Futures, and Other Derivatives, the book is designed to be suitable for practicing 
managers as well as university students. Those studying for FMA and PRM qualifications 
will also find the book useful. 

The book is appropriate for university courses in either risk management or financial 
institutions. It is not necessary for students to take a course on options and futures markets 
prior to taking a course based on this book. But if they have taken such a course, some 
of the material in the first nine chapters does not need to be covered. 

The level of mathematical sophistication and the way material is presented have been 
managed carefully so that the book 1s accessible to as wide an audience as possible. For 
example, when covering copulas in Chapter 11, I present the intuition followed by a 
detailed numerical example; when covering maximum likelihood methods in Chapter 
10 and extreme value theory in Chapter 13, I provide numerical examples and enough 
details for readers to develop their own spreadsheets. I have also provided Excel spread- 
sheets for many applications on my website: 


www-2.rotman.utoronto.ca/~hull 
This is a book about risk management, so there is very little material on the valuation 
of derivatives. (That is the main focus of my other two books, Options, Futures, and Other 


Derivatives and Fundamentals of Futures and Options Markets.) The appendices at the end of 
the book include material that summarizes valuation and other results that are important 
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in risk management. The RMFI Software (version 1.00) is designed for this book and 
can be downloaded from my website. 


New Material 


The fifth edition has been fully updated and contains much new material. In particular: 


1. A new chapter on financial innovation has been included (Chapter 28). 

2. A new chapter on the regulation of OTC derivatives markets has been included 
(Chapter 17). This covers both cleared and uncleared transactions and explains the 
Standard Initial Margin Model (SIMM). 

3. The chapter on the Fundamental Review of the Trading Book (FRTB) has been 
rewritten to provide a fuller description and reflect recent changes (Chapter 18). 

4. The chapter on the model-building approach to estimating value at risk and expected 
shortfall (Chapter 14) has been rewritten to better reflect the way the market handles 
interest rates and the way the model-building approach is used for SIMM and FRTB. 

5. The chapter on operational risk (Chapter 23) has been rewritten to reflect regulatory 
developments in this area. 

6. The chapter on model risk management (Chapter 25) has been rewritten to cover 
more than just valuation models and to reflect regulatory requirements such as SR 
11-7. 

7. At various points in the book, recent developments such as IFRS 9 and SA-CCR 
are covered. 


Slides 


Several hundred PowerPoint slides can be downloaded from my website or from the 
Wiley Higher Education website. Adopting instructors are welcome to adapt the slides 
to meet their own needs. 


Questions and Problems 


End-of-chapter problems are divided into two groups: “Practice Questions and Prob- 
lems” and “Further Questions.” Solutions to the former are at the end of the book. 
Solutions to the latter and accompanying worksheets are available to adopting instruc- 
tors from the Wiley Higher Education website. 


Instructor Material 


The instructor’s manual is made available to adopting instructors on the Wiley Higher 
Education website. It contains solutions to “Further Questions” (with Excel worksheets), 
notes on the teaching of each chapter, and some suggestions on course organization. 
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Chapter 1 


Introduction 


magine you are the Chief Risk Officer (CRO) of a major corporation. The Chief 
Í Executive Officer (CEO) wants your views on a major new venture. You have been 

inundated with reports showing that the new venture has a positive net present value 
and will enhance shareholder value. What sort of analysis and ideas is the CEO looking 
for from you? 

As CRO it is your job to consider how the new venture fits into the company’s 
portfolio. What is the correlation of the performance of the new venture with the rest 
of the company’s business? When the rest of the business is experiencing difficulties, will 
the new venture also provide poor returns, or will it have the effect of dampening the 
ups and downs in the rest of the business? 

Companies must take risks if they are to survive and prosper. The risk management 
function’s primary responsibility is to understand the portfolio of risks that the company 
is currently taking and the risks it plans to take in the future. It must decide whether the 
risks are acceptable and, if they are not acceptable, what action should be taken. 

Most of this book is concerned with the ways risks are managed by banks and 
other financial institutions, but many of the ideas and approaches we will discuss are 
equally applicable to nonfinancial corporations. Risk management has become progres- 
sively more important for all corporations in the last few decades. Financial institutions 
in particular are finding they have to increase the resources they devote to risk man- 
agement. Large “rogue trader” losses such as those at Barings Bank in 1995, Allied Irish 
Bank in 2002, Société Générale in 2007, and UBS in 2011 would have been avoided 
if procedures used by the banks for collecting data on trading positions had been more 
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Table 1.1 One-Year Return from Investing 
$100,000 in a Stock 


Probability Return 
0.05 +50% 
0.25 +30% 
0.40 +10% 
0.25 —10% 
0.05 —30% 


carefully developed. Huge subprime losses at banks such as Citigroup, UBS, and Merrill 
Lynch would have been less severe if risk management groups had been able to convince 
senior management that unacceptable risks were being taken. 

This chapter sets the scene. It starts by reviewing the classical arguments concerning 
the risk-return trade-offs faced by an investor who is choosing a portfolio of stocks and 
bonds. It then considers whether the same arguments can be used by a company in 
choosing new projects and managing its risk exposure. The chapter concludes that there 
are reasons why companies—particularly financial institutions—should be concerned 
with the total risk they face, not just with the risk from the viewpoint ofa well-diversified 
shareholder. 


1.1 Risk vs. Return for Investors 


As all fund managers know, there is a trade-off between risk and return when money is 
invested. The greater the risks taken, the higher the return that can be realized. The trade- 
off is actually between risk and expected return, not between risk and actual return. The 
term “expected return” sometimes causes confusion. In everyday language an outcome 
that is “expected” is considered highly likely to occur. However, statisticians define the 
expected value of a variable as its average (or mean) value. Expected return is therefore a 
weighted average of the possible returns, where the weight applied to a particular return 
equals the probability of that return occurring. The possible returns and their probabilities 
can be either estimated from historical data or assessed subjectively. 

Suppose, for example, that you have $100,000 to invest for one year. Suppose further 
that Treasury bills yield 5%.' One alternative is to buy Treasury bills. There is then no 
risk and the expected return is 5%. Another alternative is to invest the $100,000 in a 
stock. To simplify things, we suppose that the possible outcomes from this investment are 
as shown in Table 1.1. There is a 0.05 probability that the return will be +50%; there 


' This is close to the historical average, but quite a bit higher than the Treasury yields seen in the years 
following 2008 in many countries. 
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is a 0.25 probability that the return will be +30%; and so on. Expressing the returns in 
decimal form, the expected return per year is: 


0.05 x 0.50 + 0.25 X 0.30 + 0.40 x 0.10 + 0.25 x (—0.10) + 0.05 x (—0.30) = 0.10 


This shows that, in return for taking some risk, you are able to increase your expected 
return per annum from the 5% offered by Treasury bills to 10%. If things work out well, 
your return per annum could be as high as 50%. But the worst-case outcome is a —30% 
return or a loss of $30,000. 

One of the first attempts to understand the trade-off between risk and expected 
return was by Markowitz (1952). Later, Sharpe (1964) and others carried the Markowitz 
analysis a stage further by developing what is known as the capital asset pricing model. 
This is a relationship between expected return and what is termed “systematic risk.” In 
1976, Ross developed arbitrage pricing theory, which can be viewed as an extension 
of the capital asset pricing model to the situation where there are several sources of 
systematic risk. The key insights of these researchers have had a profound effect on the 
way portfolio managers think about and analyze the risk-return trade-offs they face. In 
this section we review these insights. 


1.1.1 Quantifying Risk 


How do you quantify the risk you are taking when you choose an investment? A con- 
venient measure that is often used is the standard deviation of the return over one year. 
This is 


E(R?) — [E(R)]? 


where R is the return per annum. The symbol E denotes expected value so that E(R) is 
the expected return per annum. In Table 1.1,as we have shown, E(R) = 0.10. To calculate 
E(R?) we must weight the alternative squared returns by their probabilities: 


E(R?) = 0.05 x 0.50° + 0.25 x 0.307 + 0.40 x 0.107 + 0.25 x (—0.10)? 
+ 0.05 x (—0.30)? = 0.046 


The standard deviation of the annual return is therefore 10.046 — 0.12 = 0.1897 or 
18.97%. 


1.1.2 Investment Opportunities 


Suppose we choose to characterize every investment opportunity by its expected return 
and standard deviation of return. We can plot available risky investments on a chart such 
as Figure 1.1, where the horizontal axis is the standard deviation of the return and the 
vertical axis is the expected return. 
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Expected 
return 


Standard deviation of return 


Figure 1.1 Alternative Risky Investments 


Once we have identified the expected return and the standard deviation of the return 
for individual investments, it is natural to think about what happens when we combine 
investments to form a portfolio. Consider two investments with returns R, and R,. The 
return from putting a proportion w, of our money in the first investment and a propor- 
tion w = 1 — w in the second investment is 


mR + wR, 
The portfolio expected return is 


Hp = Wy + wp (1.1) 


where p is the expected return from the first investment and p, is the expected return 
from the second investment. The standard deviation of the portfolio return is given by 


Op = 1/ w07 + w304 + 2pw w010, (1.2) 


where 6, and ©, are the standard deviations of R, and R, and p is the coefficient of 
correlation between the two. 

Suppose that p; is 10% per annum and ©} is 16% per annum, while p, is 15% per 
annum and ©, is 24% per annum. Suppose also that the coefficient of correlation, p, 
between the returns is 0.2 or 20%. Table 1.2 shows the values of [1p and 6p for a number 
of different values of w, and w,. The calculations show that by putting part of your 
money in the first investment and part in the second investment a wide range of risk- 
return combinations can be achieved. These are plotted in Figure 1.2. 
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Table 1.2 Expected Return, p, and Standard Deviation of Return, 
Op, from a Portfolio Consisting of Two Investments 


Wy wy Hp Sp 

0.0 1.0 15% 24.00% 
0.2 0.8 14% 20.09% 
0.4 0.6 13% 16.89% 
0.6 0.4 12% 14.87% 
0.8 0.2 11% 14.54% 
1.0 0.0 10% 16.00% 


The expected returns from the investments are 10% and 15%; the standard deviations of 
the returns are 16% and 24%; and the correlation between returns is 0.2. 


Most investors are risk-averse. They want to increase expected return while reduc- 
ing the standard deviation of return. This means that they want to move as far as they 
can in a “northwest” direction in Figures 1.1 and 1.2. Figure 1.2 shows that forming 
a portfolio of the two investments we have been considering helps them do this. For 
example, by putting 60% in the first investment and 40% in the second, a portfolio 
with an expected return of 12% and a standard deviation of return equal to 14.87% is 
obtained. This is an improvement over the risk-return trade-off for the first investment. 
(The expected return is 2% higher and the standard deviation of the return is 1.13% 
lower.) 


| Expected 
return (%) 


Standard deviation of return (%) 
0 T T T T T 1 
0 5 10 15 20 25 30 


Figure 1.2 Alternative Risk-Return Combinations from Two Investments (as Calculated in 
Table 1.2) 
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1.2 The Efficient Frontier 


Let us now bring a third investment into our analysis. The third investment can be com- 
bined with any combination of the first two investments to produce new risk-return 
combinations. This enables us to move further in the northwest direction. We can then 
add a fourth investment. This can be combined with any combination of the first three 
investments to produce yet more investment opportunities. As we continue this process, 
considering every possible portfolio of the available risky investments, we obtain what is 
known as an efficient frontier. This represents the limit of how far we can move in a north- 
west direction and is illustrated in Figure 1.3. There is no investment that dominates a 
point on the efficient frontier in the sense that it has both a higher expected return and 
a lower standard deviation of return. The area southeast of the efficient frontier repre- 
sents the set of all investments that are possible. For any point in this area that is not on 
the efficient frontier, there is a point on the efficient frontier that has a higher expected 
return and lower standard deviation of return. 

In Figure 1.3 we have considered only risky investments. What does the efficient 
frontier of all possible investments look like? Specifically, what happens when we include 
the risk-free investment? Suppose that the risk-free investment yields a return of Rp. In 
Figure 1.4 we have denoted the risk-free investment by point F and drawn a tangent from 
point F to the efficient frontier of risky investments that was developed in Figure 1.3. M 
is the point of tangency. As we will now show, the line FJ is our new efficient frontier. 

Consider what happens when we form an investment I by putting Py of the funds 
we have available for investment in the risky portfolio, M, and 1 — fp in the risk-free 
investment F (0 < P; < 1). From equation (1.1) the expected return from the investment, 
E(R;), is given by 


E(R;) = (1 - B)RE + PE(Ry) 


Expected 
return 


Efficient frontier 


Individual risky 
© investments 


Standard deviation of return 


Figure 1.3 Efficient Frontier Obtainable from Risky Investments 
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Previous efficient 
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Figure 1.4 The Efficient Frontier of All Investments 

Point I is achieved by investing a percentage p of available funds in portfolio M and the rest in a 
risk-free investment. Point J is achieved by borrowing f} — 1 of available funds at the risk-free rate 
and investing everything in portfolio M. 


and from equation (1.2), because the risk-free investment has zero standard deviation, the 
return R; has standard deviation 


Bion 


where 6), is the standard deviation of return for portfolio M. This risk-return com- 
bination corresponds to the point labeled I in Figure 1.4. From the perspective of 
both expected return and standard deviation of return, point I is p; of the way from 
F to M. 

All points on the line FM can be obtained by choosing a suitable combination of 
the investment represented by point F and the investment represented by point M. The 
points on this line dominate all the points on the previous efficient frontier because they 
give a better risk-return combination. The straight line FM is therefore part of the new 
efficient frontier. 

If we make the simplifying assumption that we can borrow at the risk-free rate of 
Rp as well as invest at that rate, we can create investments that are on the continuation of 
FM beyond M. Suppose, for example, that we want to create the investment represented 
by the point J in Figure 1.4 where the distance of J from F is P; times the distance of M 
from F (P; > 1). We borrow P; — 1 of the amount that we have available for investment 
at rate Rp and then invest everything (the original funds and the borrowed funds) in 


8 RISK MANAGEMENT AND FINANCIAL INSTITUTIONS 


the investment represented by point M. After allowing for the interest paid, the new 
investment has an expected return, E(Rj), given by 


E(R;) = PER) = (B; —1)Rp = (1 - Bp) Rp + PER) 


and the standard deviation of the return is 


Piom 


This shows that the risk and expected return combination corresponds to point J. (Note 
that the formulas for the expected return and standard deviation of return in terms of 
beta are the same whether beta is greater than or less than 1.) 

The argument that we have presented shows that, when the risk-free investment is 
considered, the efficient frontier must be a straight line. To put this another way, there 
should be a linear trade-off between the expected return and the standard deviation of 
returns, as indicated in Figure 1.4. All investors should choose the same portfolio of risky 
assets. This is the portfolio represented by M. They should then reflect their appetite 
for risk by combining this risky investment with borrowing or lending at the risk-free 
rate. 

It is a short step from here to argue that the portfolio of risky investments represented 
by M must be the portfolio of all risky investments. Suppose a particular investment is 
not in the portfolio. No investors would hold it and its price would have to go down 
so that its expected return increased and it became part of portfolio M. In fact, we 
can go further than this. To ensure a balance between the supply and demand for each 
investment, the price of each risky investment must adjust so that the amount of that 
investment in portfolio M is proportional to the amount of that investment available in 
the economy. The investment represented by point M is therefore usually referred to as 
the market portfolio. 


1.3 The Capital Asset Pricing Model 


How do investors decide on the expected returns they require for individual investments? 
Based on the analysis we have presented, the market portfolio should play a key role. 
The expected return required on an investment should reflect the extent to which the 
investment contributes to the risks of the market portfolio. 

A common procedure is to use historical data and regression analysis to determine 
a best-fit linear relationship between returns from an investment and returns from the 
market portfolio. This relationship has the form: 


R=a+ PR, +e (1.3) 


where R is the return from the investment, Ry is the return from the market portfolio, 
a and ĝ are constants, and € is a random variable equal to the regression error. 


Introduction 9 


Equation (1.3) shows that there are two uncertain components to the risk in the 
investment’s return: 


1. A component BR,,, which is a multiple of the return from the market portfolio. 
2. A component €, which is unrelated to the return from the market portfolio. 


The first component is referred to as systematic risk. The second component is referred 
to as nonsystematic risk. 

Consider first the nonsystematic risk. If we assume that the € variables for different 
investments are independent of each other, the nonsystematic risk is almost completely 
diversified away in a large portfolio. An investor should not therefore be concerned 
about nonsystematic risk and should not require an extra return above the risk-free rate 
for bearing nonsystematic risk. 

The systematic risk component is what should matter to an investor. When a large 
well-diversified portfolio is held, the systematic risk represented by BR, does not disap- 
pear. An investor should require an expected return to compensate for this systematic risk. 

We know how investors trade off systematic risk and expected return from Figure 
1.4. When B = 0 there is no systematic risk and the expected return is Rp. When p = 1, 
we have the same systematic risk as the market portfolio, which is represented by point 
M, and the expected return should be E(R,,). In general 


E(R) = Rp + B[E(Ry) — Rel (1.4) 
This is the capital asset pricing model. The excess expected return over the risk-free rate 


required on the investment is B times the excess expected return on the market portfolio. 
This relationship is plotted in Figure 1.5. The parameter P is the beta of the investment. 


\ 


Be eee rete ees ER) = Rr + BIE(Ry) ~ Rel 


Expected 
return, E(R) 


Beta, B 
1.0 


Figure 1.5 The Capital Asset Pricing Model 
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Example 1.1 


Suppose that the risk-free rate is 5% and the return on the market portfolio is 10%. An 
investment with a beta of 0 should have an expected return of 5%. This is because all 
of the risk in the investment can be diversified away. An investment with a beta of 0.5 
should have an expected return of 


0.05 + 0.5 x (0.1 — 0.05) = 0.075 
or 7.5%. An investment with a beta of 1.2 should have an expected return of 
0.05 + 1.2 x (0.1 — 0.05) = 0.11 


or 11%. 


The parameter B is equal to po/o,,, where p is the correlation between the return 
on the investment and the return on the market portfolio, o is the standard deviation 
of the return on the investment, and © is the standard deviation of the return on the 
market portfolio. Beta measures the sensitivity of the return on the investment to the 
return on the market portfolio. We can define the beta of any investment portfolio as in 
equation (1.3) by regressing its returns against the returns on the market portfolio. The 
capital asset pricing model in equation (1.4) should then apply with the return R defined 
as the return on the portfolio. In Figure 1.4 the market portfolio represented by M has a 
beta of 1.0 and the riskless portfolio represented by F has a beta of zero. The portfolios 
represented by the points J and J have betas equal to P; and p}, respectively. 


1.3.1 Assumptions 


The analysis we have presented leads to the surprising conclusion that all investors want 
to hold the same portfolios of assets (the portfolio represented by M in Figure 1.4). This 
is clearly not true. Indeed, if it were true, markets would not function at all well because 
investors would not want to trade with each other! In practice, different investors have 
different views on the attractiveness of stocks and other risky investment opportunities. 
This is what causes them to trade with each other and it is this trading that leads to the 
formation of prices in markets. 

The reason why the analysis leads to conclusions that do not correspond with the 
realities of markets is that, in presenting the arguments, we implicitly made a number of 
assumptions. In particular: 


1. We assumed that investors care only about the expected return and the standard 
deviation of return of their portfolio. Another way of saying this is that investors 
look only at the first two moments of the return distribution. If returns are normally 
distributed, it is reasonable for investors to do this. However, the returns from many 
assets are non-normal. They have skewness and excess kurtosis. Skewness is related to 
the third moment of the distribution and excess kurtosis is related to the fourth 
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moment. In the case of positive skewness, very high returns are more likely and 
very low returns are less likely than the normal distribution would predict; in the 
case of negative skewness, very low returns are more likely and very high returns 
are less likely than the normal distribution would predict. Excess kurtosis leads to 
a distribution where both very high and very low returns are more likely than the 
normal distribution would predict. Most investors are concerned about the possibility 
of extreme negative outcomes. They are likely to want a higher expected return from 
investments with negative skewness or excess kurtosis. 

2. We assumed that the € variables for different investments in equation (1.3) are inde- 
pendent. Equivalently we assumed the returns from investments are correlated with 
each other only because of their correlation with the market portfolio. This is clearly 
not true. Ford and General Motors are both in the automotive sector. There is likely 
to be some correlation between their returns that does not arise from their corre- 
lation with the overall stock market. This means that the € for Ford and the e€ for 
General Motors are not likely to be independent of each other. 

3. We assumed that investors focus on returns over just one period and the length of 
this period is the same for all investors. This is also clearly not true. Some investors 
such as pension funds have very long time horizons. Others such as day traders have 
very short time horizons. 

4. We assumed that investors can borrow and lend at the same risk-free rate. This is 
approximately true in normal market conditions for a large financial institution that 
has a good credit rating. But it is not exactly true for such a financial institution and 
not at all true for small investors. 

5. We did not consider tax. In some jurisdictions, capital gains are taxed differently from 
dividends and other sources of income. Some investments get special tax treatment 
and not all investors are subject to the same tax rate. In practice, tax considerations 
have a part to play in the decisions of an investor. An investment that is appropriate for 
a pension fund that pays no tax might be quite inappropriate for a high-marginal-rate 
taxpayer living in New York, and vice versa. 

6. Finally, we assumed that all investors make the same estimates of expected returns, 
standard deviations of returns, and correlations between returns for available invest- 
ments. To put this another way, we assumed that investors have homogeneous expecta- 
tions. This is clearly not true. Indeed, as mentioned earlier, if we lived in a world of 
homogeneous expectations there would be no trading. 


In spite of all this, the capital asset pricing model has proved to be a useful tool 
for portfolio managers. Estimates of the betas of stocks are readily available and the 
expected return on a portfolio estimated by the capital asset pricing model is a com- 
monly used benchmark for assessing the performance of the portfolio manager, as we 
will now explain. 


1.3.2 Alpha 


When we observe a return of R,, on the market, what do we expect the return on a 
M p 
portfolio with a beta of B to be? The capital asset pricing model relates the expected 
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Figure 1.6 Relationship between Expected Return on Portfolio and the Actual Return on the 
Market When Portfolio Beta Is 0.6 and Risk-Free Rate Is 4% 


return on a portfolio to the expected return on the market. But it can also be used to 
relate the expected return on a portfolio to the actual return on the market: 


E(Rp) = Re + P(Ru — Re) 
where Rp is the risk-free rate and Rp is the return on the portfolio. 


Example 1.2 


Consider a portfolio with a beta of 0.6 when the risk-free interest rate is 4%. When the 
return on the market is 20%, the expected return on the portfolio is 


0.04 + 0.6 X (0.2 — 0.04) = 0.136 


or 13.6%. When the return on the market is 10%, the expected return from the port- 
folio is 


0.04 + 0.6 X (0.1 — 0.04) = 0.076 


or 7.6%. When the return from the market is —10%, the expected return from the port- 
folio is 


0.04 + 0.6 x (—0.1 — 0.04) = —0.044 


or —4.4%. The relationship between the expected return on the portfolio and the return 
on the market is shown in Figure 1.6. 
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Suppose that the actual return on the portfolio is greater than the expected return: 
Rp > Rp + B(Ry — Rp) 


The portfolio manager has produced a superior return for the amount of systematic risk 
being taken. The extra return is 


a = Rp- Rp — B(Ry — Rp) 
This is commonly referred to as the alpha created by the portfolio manager? 


Example 1.3 


A portfolio manager has a portfolio with a beta of 0.8. The one-year risk-free rate of 
interest is 5%, the return on the market during the year is 7%, and the portfolio manager’s 
return is 9%. The manager’s alpha is 


a = 0.09 — 0.05 — 0.8 x (0.07 — 0.05) = 0.024 


or 2.4%. 


Portfolio managers are continually searching for ways of producing a positive alpha. 
One way is by trying to pick stocks that outperform the market. Another is by market 
timing. This involves trying to anticipate movements in the market as a whole and moving 
funds from safe investments such as Treasury bills to the stock market when an upturn is 
anticipated and in the other direction when a downturn is anticipated. Chapter 4 explains 
other strategies used by hedge funds to try to create positive alpha. 

Although the capital asset pricing model is unrealistic in some respects, the alpha and 
beta parameters that come out of the model are widely used to characterize investments. 
Beta describes the amount of systematic risk. The higher the value of beta, the greater 
the systematic risk being taken and the greater the extent to which returns are dependent 
on the performance of the market. Alpha represents the extra return made from superior 
portfolio management (or perhaps just good luck). An investor can make a positive alpha 
only at the expense of other investors who are making a negative alpha. The weighted 
average alpha of all investors must be zero. 


7 It is sometimes referred to as Jensen’s alpha because it was first used by Michael Jensen in evaluating 
mutual fund performance. See Section 4.3. 
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1.4 Arbitrage Pricing Theory 


Arbitrage pricing theory can be viewed as an extension of the capital asset pricing model. 
In the capital asset pricing model, an asset’s return depends on just one factor. In arbitrage 
pricing theory, the return depends on several factors. (These factors might involve vari- 
ables such as the gross national product, the domestic interest rate, and the inflation rate.) 
By exploring ways in which investors can form portfolios that eliminate exposure to the 
factors, arbitrage pricing theory shows that the expected return from an investment is 
linearly dependent on the factors. 

The assumption that the € variables for different investments are independent in 
equation (1.3) ensures that there is just one factor driving expected returns (and therefore 
one source of systematic risk) in the capital asset pricing model. This is the return on the 
market portfolio. In arbitrage pricing theory there are several factors affecting investment 
returns. Each factor is a separate source of systematic risk. Unsystematic (i.e., diversifiable) 
risk in arbitrage pricing theory is the risk that is unrelated to all the factors. 


1.5 Risk vs. Return for Companies 


We now move on to consider the trade-offs between risk and return made by a company. 
How should a company decide whether the expected return on a new investment project 
is sufficient compensation for its risks? 

The ultimate owners of a company are its shareholders and a company should be 
managed in the best interests of its shareholders. It is therefore natural to argue that a 
new project undertaken by the company should be viewed as an addition to its share- 
holders’ portfolio. The company should calculate the beta of the investment project and 
its expected return. If the expected return is greater than that given by the capital asset 
pricing model, it is a good deal for shareholders and the investment should be accepted. 
Otherwise it should be rejected. 

The argument just given suggests that nonsystematic risks should not be considered 
when accept/reject decisions on new projects are taken. In practice, companies are con- 
cerned about nonsystematic as well as systematic risks. For example, most companies 
insure themselves against the risk of their buildings burning down—even though this 
risk is entirely nonsystematic and can be diversified away by their shareholders. They 
try to avoid taking high risks and often hedge their exposures to exchange rates, interest 
rates, commodity prices, and other market variables. 

Earnings stability and the survival of the company are often important manage- 
rial objectives. Companies do try to ensure that their expected returns on new ven- 
tures are consistent with the risk-return trade-offs of their shareholders. But there is 
an overriding constraint that the total risks taken should not be allowed to get too 
large. 

Many investors are also concerned about the overall risk of the companies they invest 
in. They do not like surprises and prefer to invest in companies that show solid growth 
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and meet earnings forecasts. They like companies to manage risks carefully and limit the 
overall amount of risk—both systematic and nonsystematic—they are taking. 

The theoretical arguments we presented in Sections 1.1 to 1.4 suggest that investors 
should not behave in this way. They should hold a well-diversified portfolio and encour- 
age the companies they invest in to make high-risk investments when the combination 
of expected return and systematic risk is favorable. Some of the companies in a share- 
holder’s portfolio will go bankrupt, but others will do very well. The result should be an 
overall return to the shareholder that is satisfactory. 

Are investors behaving suboptimally? Would their interests be better served if com- 
panies took more nonsystematic risks? There is an important argument to suggest that 
this is not necessarily the case. This argument is usually referred to as the “bankruptcy 
costs” argument. It is often used to explain why a company should restrict the amount 
of debt it takes on, but it can be extended to apply to a wider range of risk management 
decisions than this. 


1.5.1 Bankruptcy Costs 


In a perfect world, bankruptcy would be a fast affair where the company’s assets (tangible 
and intangible) are sold at their fair market value and the proceeds are distributed to 
the company’s creditors using well-defined rules. If we lived in such a perfect world, the 
bankruptcy process itself would not destroy value for stakeholders. Unfortunately, the 
real world is far from perfect. By the time a company reaches the point of bankruptcy, 
it is likely that its assets have lost some value. The bankruptcy process itself invariably 
reduces the value of its assets further. This further reduction in value is referred to as 
bankruptcy costs. 

What is the nature of bankruptcy costs? Once a bankruptcy has happened, customers 
and suppliers become less inclined to do business with the company; assets sometimes 
have to be sold quickly at prices well below those that would be realized in an orderly 
sale; the value of important intangible assets, such as the company’s brand name and its 
reputation in the market, are often destroyed; the company is no longer run in the best 
interests of shareholders; large fees are often paid to accountants and lawyers; and so on. 
The story in Business Snapshot 1.1 is representative of what often happens in practice. 
It illustrates how, when a high-risk decision works out badly, there can be disastrous 
bankruptcy costs. 

The largest bankruptcy in U.S. history was that of Lehman Brothers on September 
15, 2008. Two years later, on September 14, 2010, the Financial Times reported that the 
legal and accounting fees in the United States and Europe relating to the bankruptcy of 
all the subsidiaries of the Lehman holding company had reached almost $2 billion, even 
though some of the services had been provided at discounted rates. Arguments between 
Lehman and its creditors continued well beyond 2010, and the costs of the bankruptcy 
soared even higher. 

We mentioned earlier that corporate survival is an important managerial objective 
and that shareholders like companies to avoid excessive risks. We now understand one 
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BUSINESS SNAPSHOT 1.1 
The Hidden Costs of Bankruptcy 


Several years ago, a company had a market capitalization of $2 billion and 
$500 million of debt. The CEO decided to acquire a company in a related 
industry for $1 billion in cash. The cash was raised using a mixture of bank 
debt and bond issues. The price paid for the company was justified on the basis 
of potential synergies, but key threats to the profitability of the company were 
overlooked. 

Many of the anticipated synergies were not realized. Furthermore, the com- 
pany that was acquired was not profitable and proved to be a cash drain on 
the parent company. After three years, the CEO resigned. The new CEO sold 
the acquisition for $100 million (10% of the price paid) and announced that the 
company would focus on its original core business. However, by then the com- 
pany was highly leveraged. A temporary economic downturn made it impossible 
for the company to service its debt and it declared bankruptcy. 

The offices of the company were soon filled with accountants and lawyers 
representing the interests of the various parties (banks, different categories of 
bondholders, equity holders, the company, and the board of directors). These 
people directly or indirectly billed the company about $10 million per month in 
fees. The company lost sales that it would normally have made because nobody 
wants to do business with a bankrupt company. Key senior executives left. The 
company experienced a dramatic reduction in its market share. 

After two years and three reorganization attempts, an agreement was reached 
among the various parties, and a new company with a market capitalization of 
$700,000 was incorporated to continue the remaining profitable parts of the 
business. The shares in the new company were entirely owned by the banks and 
the bondholders. The shareholders got nothing. 


reason why this is so. Bankruptcy laws vary widely from country to country, but they all 
have the effect of destroying value as lenders and other creditors vie with each other to get 
paid. Ifa company chooses projects with very high risks (but sufficiently high expected 
returns to be above the efficient frontier in Figure 1.4), the probability of bankruptcy 
will be quite high. When expected bankruptcy costs are taken into account, projects 
that have a high total (systematic plus nonsystematic) risk are liable to be rejected as 
unacceptable. This explains why investors like companies to limit the overall amount 
of risk they take and reward companies that manage risks so that they meet earnings 
forecasts. 
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When a major new investment is being contemplated, it is important to consider 
how well it fits in with other risks taken by the company. Relatively small investments 
can often have the effect of reducing the overall risks taken because of their diversifi- 
cation benefits. However, a large investment can dramatically increase these risks. Many 
spectacular corporate failures (such as the one in Business Snapshot 1.1) can be traced to 
CEOs who made large acquisitions (often highly leveraged) that did not work out. 


1.5.2 Financial Institutions 


One can argue about how important bankruptcy costs are for the decision making of a 
non-financial company, but there can be no question that it is crucially important for a 
financial institution such as a bank to keep its probability of bankruptcy very low. Large 
banks rely on wholesale deposits and instruments such as commercial paper for their 
funding. Confidence is the key to their survival. If the risk of default is perceived by 
the market to be other than very low, there will be a lack of confidence and sources of 
funding will dry up. The bank will be then be forced into liquidation—even if it is solvent 
in the sense of having positive equity. Lehman Brothers was the largest bankruptcy in 
USS. history. Northern Rock was a large failure of a financial institution in the United 
Kingdom. In both cases, the failure was because there was a lack of confidence and 
traditional sources of funding dried up. 


1.5.3 Regulation 


Even if, in spite of the arguments we have just given, the managers of a bank wanted 
to take huge risks, they would not be allowed to do so. Unlike other companies, many 
financial institutions are heavily regulated. Governments throughout the world want a 
stable financial sector. It is important that companies and private individuals have confi- 
dence in banks and insurance companies when they transact business. The regulations are 
designed to ensure that the probability of a large bank or an insurance company expe- 
riencing severe financial difficulties is low. The bailouts of financial institutions in 2008 
during the subprime crisis illustrate the reluctance of governments to let large financial 
institutions fail. Regulated financial institutions are forced to consider total risks (system- 
atic plus nonsystematic). 

Bankruptcy often arises from losses being incurred. Regulators try to ensure that the 
capital held by a bank is sufficient to provide a cushion to absorb the losses with a high 
probability. Suppose, for example, that there is considered to be only a 0.1% probability 
that a financial institution will experience a loss of $2 billion or more in a year. Regulators 
might require the bank to hold equity capital equal to $2 billion. This would ensure that 
there is a 99.9% probability that the equity capital is sufficient to absorb the losses. The 
models used by regulators are discussed in more detail in later chapters. 

The key point here is that regulators are concerned with total risks, not just systematic 
risks. Their goal is to make bankruptcy a highly unlikely event. 
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1.6 Risk Management by Financial Institutions 


There are two broad risk management strategies open to a financial institution (or any 
other organization). One approach is to identify risks one by one and handle each one 
separately. This is sometimes referred to as risk decomposition. The other is to reduce risks 
by being well diversified. This is sometimes referred to as risk aggregation. Both approaches 
are typically used by financial institutions. 

Consider, for example, the market risks incurred by the trading room of a bank. These 
risks depend on the future movements in a multitude of market variables (exchange rates, 
interest rates, stock prices, and so on). To implement the risk decomposition approach, 
the trading room is organized so that a trader is responsible for trades related to just 
one market variable (or perhaps a small group of market variables). For example, there 
could be one trader who is responsible for all trades involving the dollar-yen exchange 
rate. At the end of each day, the trader is required to ensure that certain risk measures 
are kept within limits specified by the bank. If the end of the day is approached and it 
looks as though one or more of the risk measures will be outside the specified limits, 
the trader must either get special permission to maintain the position or execute new 
hedging trades so that the limits are adhered to. (The risk measures and the way they are 
used are discussed in Chapter 8.) 

The risk managers, working in what is termed the middle office of a bank, implement 
the risk aggregation approach for the market risks being taken. This involves calculating 
at the end of each day the total risk faced by the bank from movements in all market 
variables. Hopefully, the bank is well diversified so that its overall exposure to market 
movements is fairly small. If risks are unacceptably high, then the reasons must be deter- 
mined and corrective action taken. The models used for the aggregation of market risks 
are given in Chapters 12, 13, and 14. 

Risk aggregation is a key tool for insurance companies. Consider automobile insur- 
ance. The insurance company’s payout on one particular automobile insurance policy 
is quite uncertain. However, the payout on 100,000 similar insurance policies can be 
predicted with reasonable accuracy. 

Credit risks are also traditionally managed using risk aggregation. It is important 
for a financial institution to be well diversified. If, for example, a bank lends 40% of its 
available funds to a single borrower, it is not well diversified and likely to be subject to 
unacceptable risks. If the borrower runs into financial difficulties and is unable to make 
interest and principal payments, the bank could become insolvent. 

If the bank adopts a more diversified strategy of lending 0.01% of its available funds 
to each of 10,000 different borrowers, it is in a much safer position. Suppose that the 
probability of any one borrower defaulting is 1%. We can expect that close to 100 bor- 
rowers will default in the year and the losses on these borrowers will be more than offset 
by the profits earned on the 99% of loans that perform well. To maximize the bene- 
fits of diversification, borrowers should be in different geographical regions and different 
industries. A large international bank with different types of borrowers all over the world 


Introduction 19 


is likely to be much better diversified than a small bank in Texas that lends entirely to oil 
companies. 

But, however well diversified a bank is, it is still exposed to systematic risk, which 
creates variations in the probability of default for all borrowers from year to year. Suppose 
that the probability of default for borrowers in an average year is 1%. When the economy 
is doing well, the probability of default is less than this and when there is an economic 
downturn it is liable to be considerably more than this. Models for capturing this exposure 
are discussed in later chapters. 

Since the late 1990s, we have seen the emergence of an active market for credit 
derivatives. Credit derivatives allow banks to handle credit risks one by one (risk decom- 
position) rather than relying solely on risk diversification. They also allow banks to buy 
protection against the overall level of defaults in the economy. However, for every buyer 
of credit protection there must be a seller. Many sellers of credit protection, whether on 
individual companies or on portfolios of companies, took huge losses during the credit 
crisis that started in 2007. The credit crisis is discussed further in Chapter 6. 


1.7 Credit Ratings 


Credit rating agencies provide information that is widely used by financial market partic- 
ipants for the management of credit risks. A credit rating is a measure of the credit quality 
of a debt instrument such as a bond. However, the rating of a corporate or sovereign bond 
is often assumed to be an attribute of the bond issuer rather than of the bond itself. Thus, 
if the bonds issued by a company have a rating of AAA, the company is often referred to 
as having a rating of AAA. 

The three major credit rating agencies are Moody’s, S&P, and Fitch. The best rating 
assigned by Moody’s is Aaa. Bonds with this rating are considered to have almost no 
chance of defaulting. The next best rating is Aa. Following that come A, Baa, Ba, B, Caa, 
Ca, and C. The S&P ratings corresponding to Moody’s Aaa, Aa, A, Baa, Ba, B, Caa, Ca, 
and C are AAA, AA, A, BBB, BB, B, CCC, CC, and C, respectively. To create finer rating 
measures Moody’s divides the Aa rating category into Aa1, Aa2, and Aa3; it divides A 
into A1, A2, and A3; and so on. Similarly S&P divides its AA rating category into AA+, 
AA, and AA--; it divides its A rating category into A+, A, and A—; and so on. Moody’s 
Aaa rating category and S&P’s AAA rating are not subdivided, nor usually are the two 
lowest rating categories. Fitch’s rating categories are similar to those of S&P. 

There is usually assumed to be an equivalence between the meanings of the ratings 
assigned by the different agencies. For example, a BBB+ rating from S&P is considered 
equivalent to a Baal rating from Moody’s. Instruments with ratings of BBB— (Baa3) or 
above are considered to be investment grade. Those with ratings below BBB— (Baa3) are 
termed noninvestment grade or speculative grade or junk bonds. (In August 2012, S&P created 
a stir by downgrading the debt of the U.S. government from AAA to AA+.) 
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We will learn a lot more about credit ratings in later chapters of this book. For exam- 
ple, Chapter 6 discusses the role of ratings in the credit crisis that started in 2007. Chapters 
15 and 16 provide information on how ratings are used in regulation. Chapter 19 pro- 
vides statistics on the default rates of companies with different credit ratings. Chapter 21 
examines the extent to which the credit ratings of companies change through time. 


Summary 


An important general principle in finance is that there is a trade-off between risk and 
return. Higher expected returns can usually be achieved only by taking higher risks. In 
theory, shareholders should not be concerned with risks they can diversify away. The 
expected return they require should reflect only the amount of systematic (i.e., non- 
diversifiable) risk they are bearing. 

Companies, although sensitive to the risk-return trade-offs of their shareholders, are 
concerned about total risks when they do risk management. They do not ignore the 
unsystematic risk that their shareholders can diversify away. One valid reason for this is 
the existence of bankruptcy costs, which are the costs to shareholders resulting from the 
bankruptcy process itself. 

For financial institutions such as banks and insurance companies there is another 
important reason: regulation. The regulators of financial institutions are primarily con- 
cerned with minimizing the probability that the institutions they regulate will fail. The 
probability of failure depends on the total risks being taken, not just the risks that cannot 
be diversified away by shareholders. As we will see later in this book, regulators aim to 
ensure that financial institutions keep enough capital for the total risks they are taking. 

Two general approaches to risk management are risk decomposition and risk aggre- 
gation. Risk decomposition involves managing risks one by one. Risk aggregation 
involves relying on the power of diversification to reduce risks. Banks use both approaches 
to manage market risks. Credit risks have traditionally been managed using risk aggre- 
gation, but with the advent of credit derivatives the risk decomposition approach can be 
used. 
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Practice Questions and Problems 
(Answers at End of Book) 


An investment has probabilities 0.1, 0.2, 0.35, 0.25, and 0.1 of giving returns equal 
to 40%, 30%, 15%, —5%, and —15%. What are the expected returns and the standard 
deviations of returns? 

Suppose that there are two investments with the same probability distribution of 
returns as in Problem 1.1. The correlation between the returns is 0.15. What is the 
expected return and standard deviation of return from a portfolio where money is 
divided equally between the investments? 

For the two investments considered in Figure 1.2 and Table 1.2, what are the alter- 
native risk-return combinations if the correlation is (a) 0.3, (b) 1.0, and (c) —1.0? 
What is the difference between systematic and nonsystematic risk? Which is more 
important to an equity investor? Which can lead to the bankruptcy of a corpora- 
tion? 

Outline the arguments leading to the conclusion that all investors should choose 
the same portfolio of risky investments. What are the key assumptions? 

The expected return on the market portfolio is 12% and the risk-free rate is 6%. 
What is the expected return on an investment with a beta of (a) 0.2, (b) 0.5, and 
(c) 1.4? 

“Arbitrage pricing theory is an extension of the capital asset pricing model.” Explain 
this statement. 

“The capital structure decision of a company is a trade-off between bankruptcy 
costs and the tax advantages of debt.” Explain this statement. 

What is meant by risk aggregation and risk decomposition? Which requires an in- 
depth understanding of individual risks? Which requires a detailed knowledge of 
the correlations between risks? 

A bank’s operational risk includes the risk of very large losses because of employee 
fraud, natural disasters, litigation, and so on. Do you think operational risk is best 
handled by risk decomposition or risk aggregation? (Operational risk will be dis- 
cussed in Chapter 23.) 

A bank’s profit next year will be normally distributed with a mean of 0.6% of assets 
and a standard deviation of 1.5% of assets. The bank’s equity is 4% of assets. What 
is the probability that the bank will have a positive equity at the end of the year? 
Ignore taxes. 

Why do you think that banks are regulated to ensure that they do not take too 
much risk but most other companies (for example, those in manufacturing and 
retailing) are not? 

List the bankruptcy costs incurred by the company in Business Snapshot 1.1. 

The return from the market last year was 10% and the risk-free rate was 5%. A 
hedge fund manager with a beta of 0.6 has an alpha of 4%. What return did the 
hedge fund manager earn? 
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1.15 


1.16 


1.17 


1.18 


Further Questions 


Suppose that one investment has a mean return of 8% and a standard deviation 
of return of 14%. Another investment has a mean return of 12% and a standard 
deviation of return of 20%. The correlation between the returns is 0.3. Produce a 
chart similar to Figure 1.2 showing alternative risk-return combinations from the 
two investments. 

The expected return on the market is 12% and the risk-free rate is 7%. The standard 
deviation of the return on the market is 15%. One investor creates a portfolio on 
the efficient frontier with an expected return of 10%. Another creates a portfolio 
on the efficient frontier with an expected return of 20%. What is the standard 
deviation of the return on each of the two portfolios? 

A bank estimates that its profit next year is normally distributed with a mean of 
0.8% of assets and the standard deviation of 2% of assets. How much equity (as a 
percentage of assets) does the company need to be (a) 99% sure that it will have a 
positive equity at the end of the year and (b) 99.9% sure that it will have positive 
equity at the end of the year? Ignore taxes. 

A portfolio manager has maintained an actively managed portfolio with a beta 
of 0.2. During the last year, the risk-free rate was 5% and major equity indices 
performed very badly, providing returns of about —30%. The portfolio manager 
produced a return of —10% and claims that in the circumstances it was good. Discuss 
this claim. 


Part One 


FINANCIAL INSTITUTIONS 
AND THEIR TRADING 


Chapter 2 


Banks 


he word “bank” originates from the Italian word banco. This is a desk or bench, 

covered by a green tablecloth, that was used several hundred years ago by Floren- 

tine bankers. The traditional role of banks has been to take deposits and make 
loans. The interest charged on the loans is greater than the interest paid on deposits. 
The difference between the two has to cover administrative costs and loan losses (i.e., 
losses when borrowers fail to make the agreed payments of interest and principal), while 
providing a satisfactory return on equity. 

Today, most large banks engage in both commercial and investment banking. Com- 
mercial banking involves, among other things, the deposit-taking and lending activities 
we have just mentioned. Investment banking is concerned with assisting companies in 
raising debt and equity, and providing advice on mergers and acquisitions, major cor- 
porate restructurings, and other corporate finance decisions. Large banks are also often 
involved in securities trading (e.g., by providing brokerage services). 

Commercial banking can be classified as retail banking or wholesale banking. Retail 
banking, as its name implies, involves taking relatively small deposits from private indi- 
viduals or small businesses and making relatively small loans to them. Wholesale banking 
involves the provision of banking services to medium and large corporate clients, fund 
managers, and other financial institutions. Both loans and deposits are much larger in 
wholesale banking than in retail banking. Sometimes banks fund their lending by bor- 
rowing in financial markets themselves. 

Typically the spread between the cost of funds and the lending rate is smaller for 
wholesale banking than for retail banking. However, this tends to be offset by lower 
costs. (When a certain dollar amount of wholesale lending is compared to the same 


25 


26 RISK MANAGEMENT AND FINANCIAL INSTITUTIONS 


dollar amount of retail lending, the expected loan losses and administrative costs are 
usually much less.) Banks that are heavily involved in wholesale banking and may fund 
their lending by borrowing in financial markets are referred to as money center banks. 

This chapter will review how commercial and investment banking have evolved in 
the United States over the last hundred years. It will take a first look at the way the 
banks are regulated, the nature of the risks facing the banks, and the key role of capital 
in providing a cushion against losses. 


2.1 Commercial Banking 


Commercial banking in virtually all countries has been subject to a great deal of regu- 
lation. This is because most national governments consider it important that individuals 
and companies have confidence in the banking system. Among the issues addressed by 
regulation is the capital that banks must keep, the activities they are allowed to engage in, 
deposit insurance, and the extent to which mergers and foreign ownership are allowed. 
The nature of bank regulation during the twentieth century has influenced the structure 
of commercial banking in different countries. To illustrate this, we consider the case of 
the United States. 

The United States is unusual in that it has a large number of banks (5,060 in 2017). 
This leads to a relatively complicated payment system compared with those of other 
countries with fewer banks. There are a few large money center banks such as Citigroup 
and JPMorgan Chase. There are several hundred regional banks that engage in a mixture 
of wholesale and retail banking, and several thousand community banks that specialize 
in retail banking. 

Table 2.1 summarizes the size distribution of banks in the United States in 1984 and 
2017. The number of banks declined by over 65% between the two dates. In 2017, there 
were fewer small community banks and more large banks than in 1984. Although there 
were only 102 banks (2% of the total) with assets of $10 billion or more in 2017, they 
accounted for over 84% of the assets in the U.S. banking system. 

The structure of banking in the United States is largely a result of regulatory restric- 
tions on interstate banking. At the beginning of the twentieth century, most U.S. banks 
had a single branch from which they served customers. During the early part of the 
twentieth century, many of these banks expanded by opening more branches in order 
to serve their customers better. This ran into opposition from two quarters. First, small 
banks that still had only a single branch were concerned that they would lose market 
share. Second, large money center banks were concerned that the multibranch banks 
would be able to offer check-clearing and other payment services and erode the profits 
that they themselves made from offering these services. As a result, there was pressure to 
control the extent to which community banks could expand. Several states passed laws 
restricting the ability of banks to open more than one branch within a state. 

The McFadden Act was passed in 1927 and amended in 1933. This act had the effect 
of restricting all banks from opening branches in more than one state. This restriction 
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Table 2.1 Bank Concentration in the United States in 1984 and 2017 


1984 
Percent Assets Percent 
Size (Assets) Number of Total ($ billions) of Total 
Under $100 million 12,044 83.2 404.2 16.1 
$100 million to $1 billion 2,161 14.9 513.9 20.5 
$1 billion to $10 billion 254 17 725.9 28.9 
Over $10 billion 24 0.2 864.8 34.5 
Total 14,483 2,508.9 
2017 
Percent Assets Percent 
Size (Assets) Number of Total ($ billions) of Total 
Under $100 million 1,318 26.0 78.6 0.5 
$100 million to $1 billion 3,123 61.7 988.5 6.3 
$1 billion to $10 billion 517 10.2 1,441.5 9.1 
Over $10 billion 102 2.0 13,281.2 84.1 
Total 5,060 15,789.5 


Source: FDIC Quarterly Banking Profile, www.fdic.gov. 


applied to nationally chartered as well as to state-chartered banks. One way of getting 
around the McFadden Act was to establish a multibank holding company. This is a com- 
pany that acquires more than one bank as a subsidiary. By 1956, there were 47 multibank 
holding companies. This led to the Douglas Amendment to the Bank Holding Company 
Act. This did not allow a multibank holding company to acquire a bank in a state that 
prohibited out-of-state acquisitions. However, acquisitions prior to 1956 were grandfa- 
thered (that is, multibank holding companies did not have to dispose of acquisitions made 
prior to 1956). 

Banks are creative in finding ways around regulations—particularly when it is prof- 
itable for them to do so. After 1956, one approach was to form a one-bank holding 
company. This is a holding company with just one bank as a subsidiary and a number of 
nonbank subsidiaries in different states from the bank. The nonbank subsidiaries offered 
financial services such as consumer finance, data processing, and leasing and were able to 
create a presence for the bank in other states. 

The 1970 Bank Holding Companies Act restricted the activities of one-bank holding 
companies. They were only allowed to engage in activities that were closely related to 
banking, and acquisitions by them were subject to approval by the Federal Reserve. They 
had to divest themselves of acquisitions that did not conform to the act. 

After 1970, the interstate banking restrictions started to disappear. Individual states 
passed laws allowing banks from other states to enter and acquire local banks. (Maine was 
the first to do so, in 1978.) Some states allowed free entry of other banks. Some allowed 
banks from other states to enter only if there were reciprocal agreements. (This means 
that state A allowed banks from state B to enter only if state B allowed banks from state 
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Table 2.2 Summary Balance Sheet for DLC at End of 2018 ($ millions) 


Assets Liabilities and Net Worth 
Cash 5 Deposits 90 
Marketable Securities 10 Subordinated Long-Term Debt 5 
Loans 80 Equity Capital 5 
Fixed Assets 13 o 
Total 100 Total 100 


A to do so.) In some cases, groups of states developed regional banking pacts that allowed 
interstate banking. 

In 1994, the U.S. Congress passed the Riegel-Neal Interstate Banking and Branching 
Efficiency Act. This Act led to full interstate banking becoming a reality. It permitted 
bank holding companies to acquire branches in other states. It invalidated state laws that 
allowed interstate banking on a reciprocal or regional basis. Starting in 1997, bank holding 
companies were allowed to convert out-of-state subsidiary banks into branches ofa single 
bank. Many people argued that this type of consolidation was necessary to enable U.S. 
banks to be large enough to compete internationally. The Riegel-Neal Act prepared the 
way for a wave of consolidation in the U.S. banking system (for example, the acquisition 
by JPMorgan of banks formerly named Chemical, Chase, Bear Stearns, and Washington 
Mutual). 

As a result of the credit crisis that started in 2007 and led to a number of bank failures, 
the Dodd—Frank Wall Street Reform and Consumer Protection Act was signed into law 
by President Barack Obama on July 21, 2010. This is discussed further in Section 16.5. 


2.2 The Capital Requirements of a Small Commercial Bank 


To illustrate the role of capital in banking, we consider a hypothetical small community 
bank named Deposits and Loans Corporation (DLC). DLC is primarily engaged in the 
traditional banking activities of taking deposits and making loans. A summary balance 
sheet for DLC at the end of 2018 is shown in Table 2.2 and a summary income statement 
for 2018 is shown in Table 2.3. 

Table 2.2 shows that the bank has $100 million in assets. Most of the assets (80% of 
the total) are loans made by the bank to private individuals and small corporations. Cash 
and marketable securities account for a further 15% of the assets. The remaining 5% of 
the assets are fixed assets (i.e., buildings, equipment, etc.). A total of 90% of the funding for 


Table 2.3 Summary Income Statement for 
DLC in 2018 ($ millions) 


Net Interest Income 3.00 
Loan Losses (0.80) 
Non-Interest Income 0.90 
Non-Interest Expense (2.50) 


Pre-Tax Operating Income 0.60 
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the assets comes from deposits of one sort or another from the bank’s customers. A fur- 
ther 5% is financed by subordinated long-term debt. (These are bonds issued by the bank 
to investors that rank below deposits in the event of a liquidation.) The remaining 5% is 
financed by the bank’s shareholders in the form of equity capital. The equity capital con- 
sists of the original cash investment of the shareholders and earnings retained in the bank. 

Consider next the income statement for 2018 shown in Table 2.3. The first item on 
the income statement is net interest income. This is the excess of the interest earned over 
the interest paid and is 3% of the total assets in our example. It is important for the bank to 
be managed so that net interest income remains roughly constant regardless of movements 
in interest rates of different maturities. We will discuss this in more detail in Chapter 9. 

The next item is loan losses. This is 0.8% of total assets for the year in question. 
Clearly it is very important for management to quantify credit risks and manage them 
carefully. But however carefully a bank assesses the financial health of its clients before 
making a loan, it is inevitable that some borrowers will default. This is what leads to loan 
losses. The percentage of loans that default will tend to fluctuate from year to year with 
economic conditions. It is likely that in some years default rates will be quite low, while 
in others they will be quite high. 

The next item, non-interest income, consists of income from all the activities of the 
bank other than lending money. This includes fees for the services the bank provides for 
its clients. In the case of DLC non-interest income is 0.9% of assets. 

The final item is non-interest expense and is 2.5% of assets in our example. This 
consists of all expenses other than interest paid. It includes salaries, technology-related 
costs, and other overheads. As in the case of all large businesses, these have a tendency to 
increase over time unless they are managed carefully. Banks must try to avoid large losses 
from litigation, business disruption, employee fraud, and so on. The risk associated with 
these types of losses is known as operational risk and will be discussed in Chapter 23. 


2.2.1 Capital Adequacy 


One measure of the performance of a bank is return on equity (ROE). Tables 2.2 and 
2.3 show that DLC’s before-tax ROE is 0.6/5 or 12%. If this is considered unsatisfactory, 
one way DLC might consider improving its ROE is by buying back its shares and 
replacing them with deposits so that equity financing is lower and ROE is higher. For 
example, if it moved to the balance sheet in Table 2.4 where equity is reduced to 1% of 
assets and deposits are increased to 94% of assets, its before-tax ROE would jump to 60%. 

How much equity capital does DLC need? This question can be answered by hypoth- 
esizing an extremely adverse scenario and considering whether the bank would survive. 
Suppose that there is a severe recession and as a result the bank’s loan losses rise by 
3.2% of assets to 4% next year. (We assume that other items on the income statement in 
Table 2.3 are unaffected.) The result will be a pre-tax net operating loss of 2.6% of assets 
(0.6 — 3.2 = —2.6). Assuming a tax rate of 30%, this would result in an after-tax loss of 
about 1.8% of assets.! 


' This assumes that tax losses can be carried back to offset previous profits. 


30 RISK MANAGEMENT AND FINANCIAL INSTITUTIONS 


Table 2.4 Alternative Balance Sheet for DLC at End of 2018 with Equity Only 1% of Assets 
($ millions) 


Assets Liabilities and Net Worth 
Cash 5 Deposits 94 
Marketable Securities 10 Subordinated Long-Term Debt 5 
Loans 80 Equity Capital 1 
Fixed Assets 5 a 
Total 100 Total 100 


In Table 2.2, equity capital is 5% of assets, so an after-tax loss equal to 1.8% of assets, 
although not at all welcome, can be absorbed. It would result in a reduction of the equity 
capital to 3.2% of assets. Even a second bad year similar to the first would not totally 
wipe out the equity. 

If DLC has moved to the more aggressive capital structure shown in Table 2.4, it is 
far less likely to survive. One year where the loan losses are 4% of assets would totally 
wipe out equity capital and the bank would find itself in serious financial difficulties. It 
would no doubt try to raise additional equity capital, but it is likely to find this difficult 
when in such a weak financial position. It is possible that there would be a run on the 
bank (where all depositors decide to withdraw funds at the same time) and the bank 
would be forced into liquidation. If all assets could be liquidated for book value (a big 
assumption), the long-term debt-holders would likely receive about $4.2 million rather 
than $5 million (they would in effect absorb the negative equity) and the depositors 
would be repaid in full. 

Clearly, it is inadequate for a bank to have only 1% of assets funded by equity capital. 
Maintaining equity capital equal to 5% of assets as in Table 2.2 is more reasonable. Note 
that equity and subordinated long-term debt are both sources of capital. Equity provides 
the best protection against adverse events. (In our example, when the bank has $5 million 
of equity capital rather than $1 million, it stays solvent and is unlikely to be liquidated.) 
Subordinated long-term debt-holders rank below depositors in the event of default, but 
subordinated debt does not provide as good a cushion for the bank as equity because it 
does not prevent the bank’s insolvency. 

As we shall see in Chapters 15 to 18, bank regulators have tried to ensure that the 
capital a bank keeps is sufficient to cover the risks it takes. The risks include market risks, 
credit risks, and operational risks. Equity capital is categorized as “Tier 1 capital” while 
subordinated long-term debt is categorized as “Tier 2 capital.” 


2.3 Deposit Insurance 
To maintain confidence in banks, government regulators in many countries have intro- 


duced guaranty programs. These typically insure depositors against losses up to a certain 
level. 
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The United States with its large number of small banks is particularly prone to 
bank failures. After the stock market crash of 1929 the United States experienced a 
major recession and about 10,000 banks failed between 1930 and 1933. Runs on banks 
and panics were common. In 1933, the United States government created the Federal 
Deposit Insurance Corporation (FDIC) to provide protection for depositors. Originally, 
the maximum level of protection provided was $2,500. This has been increased sev- 
eral times and became $250,000 per depositor per bank in October 2008. Banks pay an 
insurance premium that is a percentage of their domestic deposits. Since 2007, the size 
of the premium paid has depended on the bank’s capital and how safe it is considered 
to be by regulators. For well-capitalized banks, the premium might be less than 0.1% of 
the amount insured; for under-capitalized banks, it could be over 0.35% of the amount 
insured. 

Up to 1980, the system worked well. There were no runs on banks and few bank 
failures. However, between 1980 and 1990, bank failures in the United States accelerated, 
with the total number of failures during this decade being over 1,000 (larger than for 
the whole 1933 to 1979 period). There were several reasons for this. One was the way in 
which banks managed interest rate risk and we will talk about that in Chapter 9. Another 
reason was the reduction in oil and other commodity prices, which led to many loans to 
oil, gas, and agricultural companies not being repaid. 

A further reason for the bank failures was that the existence of deposit insurance 
allowed banks to follow risky strategies that would not otherwise be feasible. For example, 
they could increase their deposit base by offering high rates of interest to depositors and 
use the funds to make risky loans. Without deposit insurance, a bank could not follow this 
strategy because its depositors would see what the bank was doing, decide that the bank 
was too risky, and withdraw their funds. With deposit insurance, it can follow the strategy 
because depositors know that, if the worst happens, they are protected under FDIC. This 
is an example of what is known as moral hazard. We will talk about moral hazard further in 
Chapter 3. It can be defined as the possibility that the existence of insurance changes the 
behavior of the insured party. The introduction of risk-based deposit insurance premiums 
has reduced moral hazard to some extent. 

During the 1980s, the funds of FDIC became seriously depleted and it had to bor- 
row $30 billion from the U.S. Treasury. In December 1991, Congress passed the FDIC 
Improvement Act to prevent any possibility of the fund becoming insolvent in the future. 
Between 1991 and 2006, bank failures in the United States were relatively rare and by 
2006 the fund had reserves of about $50 billion. FDIC funds were again depleted by the 
banks that failed as a result of the credit crisis that started in 2007. However, by 2017 the 
reserves had been built up again. 


2.4 Investment Banking 


The main activity of investment banking is raising debt and equity financing for corpo- 
rations or governments. This involves originating the securities, underwriting them, and 
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then placing them with investors. In a typical arrangement a corporation approaches an 
investment bank indicating that it wants to raise a certain amount of financing in the 
form of debt, equity, or hybrid instruments such as convertible bonds. The securities are 
originated complete with legal documentation itemizing the rights of the security holder. 
A prospectus is created outlining the company’s past performance and future prospects. 
The risks faced by the company from such things as major lawsuits are included. There 
is a “road show” in which the investment bank and senior management from the com- 
pany attempt to market the securities to large fund managers. A price for the securities is 
agreed between the bank and the corporation. The bank then sells the securities in the 
market. 

There are a number of different types of arrangement between the investment bank 
and the corporation. Sometimes the financing takes the form of a private placement in 
which the securities are sold to a small number of large institutional investors, such as 
life insurance companies or pension funds, and the investment bank receives a fee. On 
other occasions it takes the form of a public offering, where securities are offered to the 
general public. A public offering may be on a best efforts or firm commitment basis. In the 
case of a best efforts public offering, the investment bank does as well as it can to place 
the securities with investors and is paid a fee that depends, to some extent, on its success. 
In the case of a firm commitment public offering, the investment bank agrees to buy 
the securities from the issuer at a particular price and then attempts to sell them in the 
market for a slightly higher price. It makes a profit equal to the difference between the 
price at which it sells the securities and the price it pays the issuer. If for any reason it is 
unable to sell the securities, it ends up owning them itself. The difference between the 
two arrangements is illustrated in Example 2.1. 


Example 2.1 


A bank has agreed to underwrite an issue of 50 million shares by ABC Corporation. In 
negotiations between the bank and the corporation the target price to be received by the 
corporation has been set at $30 per share. This means that the corporation is expecting 
to raise 30 X 50 million dollars or $1.5 billion in total. The bank can either offer the 
client a best efforts arrangement where it charges a fee of $0.30 per share sold so that, 
assuming all shares are sold, it obtains a total fee of 0.3 X 50 = $15 million. Alternatively, 
it can offer a firm commitment where it agrees to buy the shares from ABC Corporation 
for $30 per share. 

The bank is confident that it will be able to sell the shares, but is uncertain about the 
price. As part of its procedures for assessing risk, it considers two alternative scenarios. 
Under the first scenario, it can obtain a price of $32 per share; under the second scenario, 
it is able to obtain only $29 per share. 

In a best-efforts deal, the bank obtains a fee of $15 million in both cases. In a firm 
commitment deal, its profit depends on the price it is able to obtain. If it sells the shares 
for $32, it makes a profit of (32 — 30) x 50 = $100 million because it has agreed to pay 
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ABC Corporation $30 per share. However, if it can only sell the shares for $29 per 
share, it loses (30 — 29) X 50 = $50 million because it still has to pay ABC Corporation 
$30 per share. The situation is summarized in the table below. The decision taken is likely 
to depend on the probabilities assigned by the bank to different outcomes and what is 
referred to as its “risk appetite” (see Section 27.1). 


Profits If Profits If 
Best Efforts Firm Commitment 
Can sell at $29 +$15 million —$50 million 
Can sell at $32 +$15 million +$100 million 


When equity financing is being raised and the company is already publicly traded, 
the investment bank can look at the prices at which the company’s shares are trading a 
few days before the issue is to be sold as a guide to the issue price. Typically it will agree 
to attempt to issue new shares at a target price slightly below the current price. The main 
risk then is that the price of the company’s shares will show a substantial decline before 
the new shares are sold. 


2.4.1 IPOs 


When the company wishing to issue shares is not publicly traded, the share issue is known 
as an initial public offering (IPO). This type of offering is typically made on a best efforts 
basis. The correct offering price is difficult to determine and depends on the invest- 
ment bank’s assessment of the company’s value. The bank’s best estimate of the market 
price is its estimate of the company’s value divided by the number of shares currently 
outstanding. However, the bank will typically set the offering price below its best esti- 
mate of the market price. This is because it does not want to take the chance that the 
issue will not sell. (It typically earns the same fee per share sold regardless of the offering 
price.) 

Often there is a substantial increase in the share price immediately after shares are 
sold in an IPO (sometimes as much as 40%), indicating that the company could have 
raised more money if the issue price had been higher. As a result, IPOs are considered 
attractive buys by many investors. Banks frequently offer IPOs to the fund managers who 
are their best customers and to senior executives of large companies in the hope that they 
will provide them with business. (The latter is known as “spinning” and is frowned upon 
by regulators.) 


2.4.2 Dutch Auction Approach 


A few companies have used a Dutch auction approach for their IPOs. As for a regular 
IPO, a prospectus is issued and usually there is a road show. Individuals and companies 
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bid by indicating the number of shares they want and the price they are prepared to pay. 
Shares are first issued to the highest bidder, then to the next highest bidder, and so on, 
until all the shares have been sold. The price paid by all successful bidders is the lowest 
bid that leads to a share allocation. This is illustrated in Example 2.2. 


Example 2.2 


A company wants to sell one million shares in an IPO. It decides to use the Dutch auction 
approach. The bidders are shown in the table below. In this case, shares are allocated first 
to C, then to F then to E, then to H, then to A. At this point, 800,000 shares have been 
allocated. The next highest bidder is D, who has bid for 300,000 shares. Because only 
200,000 shares remain unallocated, D’s order is only two-thirds filled. The price paid by 
all the investors to whom shares are allocated (A, C, D, E, Ẹ and H) is the price bid by D, 
or $29.00. 


Bidder Number of Shares Price 
A 100,000 $30.00 
B 200,000 $28.00 
C 50,000 $33.00 
D 300,000 $29.00 
E 150,000 $30.50 
F 300,000 $31.50 
G 400,000 $25.00 
H 200,000 $30.25 


Dutch auctions potentially overcome two of the problems with a traditional IPO that 
we have mentioned. First, the price that clears the market ($29.00 in Example 2.2) should 
be the market price if all potential investors have participated in the bidding process. 
Second, the situations where investment banks offer IPOs only to their favored clients 
are avoided. However, the company does not take advantage of the relationships that 
investment bankers have developed with large investors that usually enable the investment 
bankers to sell an IPO very quickly. One high-profile IPO that used a Dutch auction 
was the Google IPO in 2004. This is discussed in Business Snapshot 2.1. 


2.4.3 Advisory Services 


In addition to assisting companies with new issues of securities, investment banks offer 
advice to companies on mergers and acquisitions, divestments, major corporate restruc- 
turings, and so on. They will assist in finding merger partners and takeover targets or help 
companies find buyers for divisions or subsidiaries of which they want to divest them- 
selves. They will also advise the management of companies that are themselves merger 
or takeover targets. 
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BUSINESS SNAPSHOT 2.1 
Google’s IPO 


Google, developer of the well-known Internet search engine, decided to go 
public in 2004. It chose the Dutch auction approach. It was assisted by two 
investment banks, Morgan Stanley and Credit Suisse First Boston. The SEC gave 
approval for it to raise funds up to a maximum of $2,718,281,828. (Why the odd 
number? The mathematical constant e is 2.7182818 ...) The IPO method was 
not a pure Dutch auction because Google reserved the right to change the num- 
ber of shares that would be issued and the percentage allocated to each bidder 
when it saw the bids. 

Some investors expected the price of the shares to be as high as $120. But 
when Google saw the bids, it decided that the number of shares offered would be 
19,605,052 at a price of $85. This meant that the total value of the offering was 
19,605,052 x 85 or $1.67 billion. Investors who had bid $85 or above obtained 
74.2% of the shares they had bid for. The date of the IPO was August 19, 2004. 
Most companies would have given investors who bid $85 or more 100% of the 
amount they bid for and raised $2.25 billion, instead of $1.67 billion. Perhaps 
Google (stock symbol: GOOG) correctly anticipated it would have no difficulty 
in selling further shares at a higher price later. 

The initial market capitalization was $23.1 billion with over 90% of the shares 
being held by employees. These employees included the founders, Sergey Brin 
and Larry Page, and the CEO, Eric Schmidt. On the first day of trading, the shares 
closed at $100.34, 18% above the offer price, and there was a further 7% increase 
on the second day. Google’s issue therefore proved to be underpriced—but not 
as underpriced as some other IPOs of technology stocks where traditional IPO 
methods were used. 

The cost of Google’s IPO (fees paid to investment banks, etc.) was 2.8% of 
the amount raised. This compares with an average of about 4% for a regular IPO. 

There were some mistakes made and Google was lucky that these did not 
prevent the IPO from going ahead as planned. Sergey Brin and Larry Page gave 
an interview to Playboy magazine in April 2004. The interview appeared in the 
September issue. This violated SEC requirements that there be a “quiet period” 
with no promoting of the company’s stock in the period leading up to an IPO. 
To avoid SEC sanctions, Google had to include the Playboy interview (together 
with some factual corrections) in its SEC filings. Google also forgot to register 
23.2 million shares and 5.6 million stock options. 

Google’s stock price rose rapidly in the period after the IPO. Approximately 
one year later (in September 2005) it was able to raise a further $4.18 billion 
by issuing an additional 14,159,265 shares at $295. (Why the odd number? The 
mathematical constant m is 3.14159265 ...) 
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Sometimes banks suggest steps their clients should take to avoid a merger or takeover. 
These are known as poison pills. Examples of poison pills are: 


1. A potential target adds to its charter a provision where, if another company acquires 
one third of the shares, other shareholders have the right to sell their shares to that 
company for twice the recent average share price. 

2. A potential target grants to its key employees stock options that vest (i.e., can be exer- 
cised) in the event of a takeover. This is liable to create an exodus of key employees 
immediately after a takeover, leaving an empty shell for the new owner. 

3. A potential target adds to its charter provisions making it impossible for a new owner 
to get rid of existing directors for one or two years after an acquisition. 

4. A potential target issues preferred shares that automatically get converted to regular 
shares when there is a change in control. 

5. A potential target adds a provision where existing shareholders have the right to 
purchase shares at a discounted price during or after a takeover. 

6. A potential target changes the voting structure so that shares owned by management 
have more votes than those owned by others. 


Poison pills, which are illegal in many countries outside the United States, have to be 
approved by a majority of shareholders. Often shareholders oppose poison pills because 
they see them as benefiting only management. An unusual poison pill, tried by PeopleSoft 
to fight a takeover by Oracle, is explained in Business Snapshot 2.2. 

Valuation, strategy, and tactics are key aspects of the advisory services offered by an 
investment bank. For example, in advising Company A on a potential takeover of Com- 
pany B, it is necessary for the investment bank to value Company B and help Company 
A assess possible synergies between the operations of the two companies. It must also 
consider whether it is better to offer Company B’s shareholders cash or a share-for-share 
exchange (i.e., a certain number of shares in Company A in exchange for each share of 
Company B). What should the initial offer be? What does it expect the final offer that 
will close the deal to be? It must assess the best way to approach the senior managers 
of Company B and consider what the motivations of the managers will be. Will the 
takeover be a hostile one (opposed by the management of Company B) or friendly one 
(supported by the management of Company B)? In some instances there will be antitrust 
issues, and approval from some branch of government may be required. 


2.5 Securities Trading 


Banks often get involved in securities trading, providing brokerage services, and making 
a market in individual securities. In doing so, they compete with smaller securities firms 
that do not offer other banking services. The Dodd—Frank Act in the United States, 
which will be discussed later in this book, does not allow banks to engage in proprietary 
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BUSINESS SNAPSHOT 2.2 
PeopleSoft’s Poison Pill 


In 2003, the management of PeopleSoft, Inc., a company that provided human 
resource management systems, was concerned about a takeover by Oracle,a com- 
pany specializing in database management systems. It took the unusual step of 
guaranteeing to its customers that, if it were acquired within two years and prod- 
uct support was reduced within four years, its customers would receive a refund 
of between two and five times the fees paid for their software licenses. The hypo- 
thetical cost to Oracle was estimated at $1.5 billion. The guarantee was opposed 
by PeopleSoft’s shareholders. (It appears to be not in their interests.) PeopleSoft 
discontinued the guarantee in April 2004. 

Oracle did succeed in acquiring PeopleSoft in December 2004. Although 
some jobs at PeopleSoft were eliminated, Oracle maintained at least 90% of Peo- 
pleSoft’s product development and support staff. 


trading. In some other countries, proprietary trading is allowed, but it usually has to be 
organized so that losses do not affect depositors. 

Most large investment and commercial banks have extensive trading activities. Apart 
from proprietary trading (which may or may not be allowed), banks trade to provide 
services to their clients. (For example, a bank might enter into a derivatives transaction 
with a corporate client to help it reduce its foreign exchange risk.) They also trade 
(typically with other financial institutions) to hedge their risks. 

A broker assists in the trading of securities by taking orders from clients and arranging 
for them to be carried out on an exchange. Some brokers operate nationally, and some 
serve only a particular region. Some, known as full-service brokers, offer investment 
research and advice. Others, known as discount brokers, charge lower commissions, but 
provide no advice. Some offer online services, and some, such as E*Trade, provide a 
platform for customers to trade without a broker. 

A market maker facilitates trading by always being prepared to quote a bid (the price 
at which it is prepared to buy) and an offer (the price at which it is prepared to sell). 
When providing a quote, it does not know whether the person requesting the quote 
wants to buy or sell. The market maker makes a profit from the spread between the bid 
and the offer, but takes the risk that it will be left with a big long or short position and 
lose money. 

Many exchanges on which stocks, options, and futures trade use market makers. 
Typically, an exchange will specify a maximum level for the size of a market maker’s bid- 
offer spread (the difference between the offer and the bid). Banks have in the past been 
market makers for instruments such as forward contracts, swaps, and options trading in the 
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over-the-counter (OTC) market. (See Chapter 5 for a discussion of these instruments 
and the over-the-counter market.) The trading and market making of these types of 
instruments is now increasingly being carried out on electronic platforms that are known 
as swap execution facilities (SEFs) in the United States and organized trading facilities 
(OTFs) in Europe. (See Sections 16.5 and 17.2.) 


2.6 Potential Conflicts of Interest in Banking 


There are many potential conflicts of interest between commercial banking, securities 
services, and investment banking when they are all conducted under the same corporate 
umbrella. For example: 


1. When asked for advice by an investor, a bank might be tempted to recommend 
securities that the investment banking part of its organization is trying to sell. When 
it has a fiduciary account (i.e.,a customer account where the bank can choose trades 
for the customer), the bank can “stuff” difficult-to-sell securities into the account. 

2. A bank, when it lends money to a company, often obtains confidential information 
about the company. It might be tempted to pass that information to the mergers and 
acquisitions arm of the investment bank to help it provide advice to one of its clients 
on potential takeover opportunities. 

3. The research end of the securities business might be tempted to recommend a com- 
pany’s share as a “buy” in order to please the company’s management and obtain 
investment banking business. 

4. Suppose a commercial bank no longer wants a loan it has made to a company on its 
books because the confidential information it has obtained from the company leads 
it to believe that there is an increased chance of bankruptcy. It might be tempted to 
ask the investment bank to arrange a bond issue for the company, with the proceeds 
being used to pay off the loan. This would have the effect of replacing its loan with 
a loan made by investors who were less well informed. 


As a result of these types of conflicts of interest, some countries have in the past 
attempted to separate commercial banking from investment banking. The Glass-Steagall 
Act of 1933 in the United States limited the ability of commercial banks and investment 
banks to engage in each other’s activities. Commercial banks were allowed to continue 
underwriting Treasury instruments and some municipal bonds. They were also allowed 
to do private placements. But they were not allowed to engage in other activities such as 
public offerings. Similarly, investment banks were not allowed to take deposits and make 
commercial loans. 

In 1987, the Federal Reserve Board relaxed the rules somewhat and allowed banks 
to establish holding companies with two subsidiaries, one in investment banking and the 
other in commercial banking. The revenue of the investment banking subsidiary was 
restricted to being a certain percentage of the group’s total revenue. 
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In 1997, the rules were relaxed further so that commercial banks could acquire exist- 
ing investment banks. Finally, in 1999, the Financial Services Modernization Act was 
passed. This effectively eliminated all restrictions on the operations of banks, insurance 
companies, and securities firms. In 2007, there were five large investment banks in the 
United States that had little or no commercial banking interests. These were Goldman 
Sachs, Morgan Stanley, Merrill Lynch, Bear Stearns, and Lehman Brothers. In 2008, the 
credit crisis led to Lehman Brothers going bankrupt, Bear Stearns being taken over by 
JPMorgan Chase, and Merrill Lynch being taken over by Bank of America. Goldman 
Sachs and Morgan Stanley became bank holding companies with both commercial and 
investment banking interests. (As a result, they have had to subject themselves to more 
regulatory scrutiny.) The year 2008 therefore marked the end of an era for investment 
banking in the United States. 

We have not returned to the Glass-Steagall world where investment banks and com- 
mercial banks were kept separate. But increasingly banks are required to ring-fence 
their deposit-taking businesses so that they cannot be affected by losses in investment 


banking. 


2.7 Today’s Large Banks 


Today’s large banks operate globally and transact business in many different areas. They are 
still engaged in the traditional commercial banking activities of taking deposits, making 
loans, and clearing checks (both nationally and internationally). They offer retail cus- 
tomers credit cards, telephone banking, Internet banking, and automatic teller machines 
(ATMs). They provide payroll services to businesses and, as already mentioned, they have 
large trading activities. 

Banks offer lines of credit to businesses and individual customers. They provide a 
range of services to companies when they export goods and services. Companies can 
enter into a variety of contracts with banks that are designed to hedge risks they face 
relating to foreign exchange, commodity prices, interest rates, and other market variables. 
These contracts will be discussed in later chapters. Even risks related to the weather can 
be hedged. 


Banks undertake securities research and offer “buy, 
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sell,’ and “hold” recommenda- 
tions on individual stocks. They offer brokerage services (discount and full service). They 
offer trust services where they are prepared to manage portfolios of assets for clients. They 
have economics departments that consider macroeconomic trends and actions likely to be 
taken by central banks. These departments produce forecasts on interest rates, exchange 
rates, commodity prices, and other variables. Banks offer a range of mutual funds and 
in some cases have their own hedge funds. Increasingly banks are offering insurance 
products. 

The investment banking arm of a bank has complete freedom to underwrite securi- 
ties for governments and corporations. It can provide advice to corporations on mergers 
and acquisitions and other topics relating to corporate finance. 


40 RISK MANAGEMENT AND FINANCIAL INSTITUTIONS 


How are the conflicts of interest outlined in Section 2.6 handled? There are inter- 
nal barriers known as Chinese walls. These internal barriers prohibit the transfer of 
information from one part of the bank to another when this is not in the best interests of 
one or more of the bank’s customers. There have been some well-publicized violations 
of conflict-of-interest rules by large banks. These have led to hefty fines and lawsuits. Top 
management has a big incentive to enforce Chinese walls. This is not only because of 
the fines and lawsuits. A bank’s reputation is its most valuable asset. The adverse publicity 
associated with conflict-of-interest violations can lead to a loss of confidence in the bank 
and business being lost in many different areas. 


2.7.1 Accounting 


It is appropriate at this point to provide a brief discussion of how a bank calculates a 
profit or loss from its many diverse activities. Activities that generate fees, such as most 
investment banking activities, are straightforward. Accrual accounting rules similar to 
those that would be used by any other business apply. 

For other banking activities, there is an important distinction between the “banking 
book” and the “trading book.’ As its name implies, the trading book includes all the 
assets and liabilities the bank has as a result of its trading operations. The values of these 
assets and liabilities are marked to market daily. This means that the value of the book 
is adjusted daily to reflect changes in market prices. If a bank trader buys an asset for 
$100 on one day and the price falls to $60 the next day, the bank records an immedi- 
ate loss of $40—even if it has not sold the asset. Sometimes it is not easy to estimate 
the value of a contract that has been entered into because there are no market prices 
for similar transactions. For example, there might be a lack of liquidity in the market or 
it might be the case that the transaction is a complex nonstandard derivative that does 
not trade sufficiently frequently for benchmark market prices to be available. Banks are 
nevertheless expected to come up with a market price in these circumstances. Often a 
model has to be assumed. The process of coming up with a “market price” is then some- 
times termed marking to model. (Chapter 25 discusses model risk and accounting issues 
further.) 

The banking book includes loans made to corporations and individuals. Traditionally 
these have not been marked to market. However, this is changing as a result of a new 
accounting standard from the International Accounting Standards Board, IFRS 9, and 
similar accounting updates from the Financial Accounting Standards Board in the United 
States. The new accounting rules require lenders to estimate the amount of credit losses 
expected in their loan portfolios and adjust the value of the loan portfolios accordingly. 
The new rules are partly a result of the financial crisis that started in 2007. Banks had 
portfolios of subprime mortgages that performed badly. However, losses did not have to 
be reported until they had been actually incurred. 

Occasionally banks have resorted to artificial ways of avoiding the recognition of 
loan losses, as outlined in Business Snapshot 2.3. 
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BUSINESS SNAPSHOT 2.3 
How to Avoid Loan Losses 


When a borrower is experiencing financial difficulties and is unable to make 
interest and principal payments as they become due, it is sometimes tempting to 
lend more money to the borrower so that the payments on the old loans can be 
kept up to date. No losses are then recorded. 

In the 1970s, banks in the United States and other countries lent huge 
amounts of money to Eastern European, Latin American, and other less devel- 
oped countries (LDCs). Some of the loans were made to help countries develop 
their infrastructure, but others were less justifiable (e.g., one was to finance the 
coronation of a ruler in Africa). Sometimes the money found its way into the 
pockets of dictators. For example, the Marcos family in the Philippines allegedly 
transferred billions of dollars into its own bank accounts. 

In the early 1980s, many LDCs were unable to service their loans. One 
option for them was debt repudiation, but a more attractive alternative was debt 
rescheduling, where more money was lent to cover required payments on existing 
loans. 

In 1987, Citicorp (now Citigroup) took the lead in refusing to reschedule 
LDC debt and increased its loan loss reserves by $3 billion in recognition of 
expected losses on the debt. Other banks with large LDC exposures followed suit. 


2.7.2 The Originate-to-Distribute Model 


DLC, the small hypothetical bank we looked at in Tables 2.2 to 2.4, took deposits and 
used them to finance loans. An alternative approach is known as the originate-to-distribute 
model. This involves the bank originating but not keeping loans. Portfolios of loans are 
packaged into tranches that are then sold to investors. 

The originate-to-distribute model has been used in the U.S. mortgage market for 
many years. In order to increase the liquidity of the U.S. mortgage market and facilitate 
the growth of home ownership, three government sponsored entities have been created: 
the Government National Mortgage Association (GNMA) or “Ginnie Mae,” the Federal 
National Mortgage Association (FNMA) or “Fannie Mae,” and the Federal Home Loan 
Mortgage Corporation (FHLMC) or “Freddie Mac.” These agencies buy pools of mort- 
gages from banks and other mortgage originators, guarantee the timely repayment of 
interest and principal, and then package the cash flow streams and sell them to investors. 
The investors typically take what is known as prepayment risk. This is the risk that inter- 
est rates will decrease and mortgages will be paid off earlier than expected. However, they 
do not take any credit risk because the mortgages are guaranteed by GNMA, FNMA, or 
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FHLMC. In 1999, the agencies started to guarantee subprime loans and as a result ran 
into serious financial difficulties? 

The originate-to-distribute model has been used for many types of bank lending, 
including student loans, commercial loans, commercial mortgages, residential mortgages, 
and credit card receivables. In many cases there is no guarantee that payment will be made 
so that it is the investors who bear the credit risk when the loans are packaged and sold. 

The originate-to-distribute model is also termed securitization because securities are 
created from cash flow streams originated by the bank. It is an attractive model for banks. 
By securitizing its loans it gets them off the balance sheet and frees up funds to enable 
it to make more loans. It also frees up capital that can be used to cover risks being taken 
elsewhere in the bank. (This is particularly attractive if the bank feels that the capital 
required by regulators for a loan is too high.) A bank earns a fee for originating a loan 
and a further fee if it services the loan after it has been sold. 

As we will explain in Chapter 6, the originate-to-distribute model got out of 
control during the 2000 to 2006 period. Banks relaxed their mortgage lending standards 
and the credit quality of the instruments being originated declined sharply. This led to 
a severe credit crisis and a period during which the originate-to-distribute model could 
not be used by banks because investors had lost confidence in the securities that had 
been created. 


2.8 The Risks Facing Banks 


A bank’s operations give rise to many risks. Much of the rest of this book is devoted to 
considering these risks in detail. 

Central bank regulators require banks to hold capital for the risks they are bearing. 
In 1988, international standards were developed for the determination of this capital. 
These standards and the way they have evolved since 1988 are discussed in Chapters 15, 
16, and 17. Capital is now required for three types of risk: credit risk, market risk, and 
operational risk. 

Credit risk is the risk that counterparties in loan transactions and derivatives transac- 
tions will default. This has traditionally been the greatest risk facing a bank and is usually 
the one for which the most regulatory capital is required. Market risk arises primarily 
from the bank’s trading operations. It is the risk relating to the possibility that instru- 
ments in the bank’s trading book will decline in value. Operational risk, which is often 
considered to be the biggest risk facing banks, is the risk that losses are created because 
internal systems fail to work as they are supposed to or because of external events. The 
time horizon used by regulators for considering losses from credit risks and operational 
risks is one year, whereas the time horizon for considering losses from market risks is 


?GNMA has always been government owned whereas FNMA and FHLMC used to be private cor- 
porations with shareholders. As a result of their financial difficulties in 2008, the U.S. government had 
to step in and assume complete control of FNMA and FHLMC. 
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usually much shorter. The objective of regulators is to keep the total capital of a bank 
sufficiently high that the chance of a bank failure is very low. For example, in the case 
of credit risk and operational risk, the capital is chosen so that the chance of unexpected 
losses exceeding the capital in a year is 0.1%. 

In addition to calculating regulatory capital, most large banks have systems in place 
for calculating what is termed economic capital (see Chapter 26). This is the capital that the 
bank, using its own models rather than those prescribed by regulators, thinks it needs. 
Economic capital is often less than regulatory capital. However, banks have no choice but 
to maintain their capital above the regulatory capital level. The form the capital can take 
(equity, subordinated debt, etc.) is prescribed by regulators. To avoid having to raise capital 
at short notice, banks try to keep their capital comfortably above the regulatory minimum. 

When banks announced huge losses on their subprime mortgage portfolios in 2007 
and 2008, many had to raise new equity capital in a hurry. Sovereign wealth funds, which are 
investment funds controlled by the government of a country, have provided some of this 
capital. For example, Citigroup, which reported losses in the region of $40 billion, raised 
$7.5 billion in equity from the Abu Dhabi Investment Authority in November 2007 and 
$14.5 billion from investors that included the governments of Singapore and Kuwait in 
January 2008. Later, Citigroup and many other banks required capital injections from 
their own governments to survive. 


Summary 


Banks are complex global organizations engaged in many different types of activities. 
Today, the world’s large banks are engaged in taking deposits, making loans, underwriting 
securities, trading, providing brokerage services, providing fiduciary services, advising on 
a range of corporate finance issues, offering mutual funds, providing services to hedge 
funds, and so on. There are potential conflicts of interest and banks develop internal rules 
to avoid them. It is important that senior managers are vigilant in ensuring that employees 
obey these rules. The cost in terms of reputation, lawsuits, and fines from inappropriate 
behavior where one client (or the bank) is advantaged at the expense of another client 
can be very high. 

There are now international agreements on the regulation of banks. This means that 
the capital banks are required to keep for the risks they are bearing does not vary too 
much from one country to another. Many countries have guaranty programs that protect 
small depositors from losses arising from bank failures. This has the effect of maintaining 
confidence in the banking system and avoiding mass withdrawals of deposits when there 
is negative news (or perhaps just a rumor) about problems faced by a particular bank. 


Further Reading 


Saunders, A. and M. M. Cornett. Financial Institutions Management: A Risk Management Approach. 9th 
ed. New York: McGraw-Hill, 2017. 
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Practice Questions and Problems 
(Answers at End of Book) 


How did concentration in the U.S. banking system change between 1984 and 2017? 
What government policies led to the large number of small community banks in 
the United States? 

What risks does a bank take if it funds long-term loans with short-term deposits? 
Suppose that an out-of-control trader working for DLC Bank (see Tables 2.2 and 
2.3) loses $7 million trading foreign exchange. What do you think would happen? 
What is meant by net interest income? 

Which items on the income statement of DLC Bank in Section 2.2 are most likely 
to be affected by (a) credit risk, (b) market risk, and (c) operational risk? 

Explain the terms “private placement” and “public offering.’ What is the difference 
between “best efforts” and “firm commitment” for a public offering? 

The bidders in a Dutch auction are as follows: 


Bidder Number of Shares Price 

A 20,000 $100.00 
B 30,000 $93.00 
C 50,000 $110.00 
D 70,000 $88.00 
E 60,000 $80.00 
F 10,000 $105.00 
G 90,000 $70.00 
H 80,000 $125.00 


The number of shares being auctioned is 150,000. What is the price paid by 
investors? How many shares does each investor receive? 

What is the attraction of a Dutch auction over the normal procedure for an IPO? 
In what ways was Google’s IPO different from a standard Dutch auction? 
Management sometimes argues that poison pills are in the best interests of share- 
holders because they enable management to extract a higher price from would-be 
acquirers. Discuss this argument. 

Give three examples of the conflicts of interest in a large bank. How are conflicts 
of interest handled? 

What is the difference between the banking book and the trading book? 

How has accounting for loans changed since the 2007—2008 financial crisis? 
What is the originate-to-distribute model? 


Further Questions 


Regulators calculate that DLC bank (see Section 2.2) will report a profit that is nor- 
mally distributed with a mean of $0.6 million and a standard deviation of $2 million. 
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How much equity capital in addition to that in Table 2.2 should regulators require 
for there to be a 99.9% chance of the capital not being wiped out by losses? 

2.16 Explain the moral hazard problems with deposit insurance. How can they be 
overcome? 

2.17 The bidders in a Dutch auction are as follows: 


Bidder Number of Shares Price 


A 60,000 $50.00 
B 20,000 $80.00 
C 30,000 $55.00 
D 40,000 $38.00 
E 40,000 $42.00 
F 40,000 $42.00 
G 50,000 $35.00 
H 50,000 $60.00 


The number of shares being auctioned is 210,000. What is the price paid by 
investors? How many shares does each investor receive? 

2.18 An investment bank has been asked to underwrite an issue of 10 million shares by 
a company. It is trying to decide between a firm commitment where it buys the 
shares for $10 per share and a best efforts where it charges a fee of 20 cents for each 
share sold. Explain the pros and cons of the two alternatives. 


Chapter 3 


Insurance Companies 
and Pension Plans 


T he role of insurance companies is to provide protection against adverse events. 
The company or individual seeking protection is referred to as the policyholder. 
The policyholder makes regular payments, known as premiums, and receives pay- 
ments from the insurance company if certain specified events occur. Insurance is usually 
classified as life insurance and nonlife insurance, with health insurance often considered to 
be a separate category. Nonlife insurance is also referred to as property-casualty insurance, 
and this is the terminology we will use here. 

A life insurance contract typically lasts a long time and provides payments to the 
policyholder’s beneficiaries that depend on when the policyholder dies. A property- 
casualty insurance contract typically lasts one year (although it may be renewed) and 
provides compensation for losses from accidents, fire, theft, and so on. 

Insurance has existed for many years. As long ago as 200 B.C., there was an arrange- 
ment in ancient Greece where an individual could make a lump-sum payment (the 
amount dependent on his or her age) and obtain a monthly income for life. The Romans 
had a form of life insurance where an individual could purchase a contract that would 
provide a payment to relatives on his or her death. In ancient China, a form of property- 
casualty insurance existed between merchants where, if the ship of one merchant sank, 
the rest of the merchants would provide compensation. 

A pension plan is a form of insurance arranged by a company for its employees. 
It is designed to provide the employees with income for the rest of their lives once 
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they have retired. Typically both the company and its employees make regular monthly 
contributions to the plan and the funds in the plan are invested to provide income for 
retirees. 

This chapter describes how the contracts offered by insurance companies work. It 
explains the risks that insurance companies face and the way they are regulated. It also 
discusses key issues associated with pension plans. 


3.1 Life Insurance 


In life insurance contracts, the payments to the policyholder depend—at least to some 
extent—on when the policyholder dies. Outside the United States, the term life assurance 
is often used to describe a contract where the event being insured against is certain to 
happen at some future time (e.g., a contract that will pay $100,000 on the policyholder’s 
death). Life insurance is used to describe a contract where the event being insured against 
may never happen (for example, a contract that provides a payoff in the event of the 
accidental death of the policyholder). In the United States, all types of life policies are 
referred to as life insurance and this is the terminology that will be adopted here. 

There are many different types of life insurance products. The products available vary 
from country to country. We will now describe some of the more common ones. 


3.1.1 Term Life Insurance 


Term life insurance (sometimes referred to as temporary life insurance) lasts a predetermined 
number of years. If the policyholder dies during the life of the policy, the insurance 
company makes a payment to the specified beneficiaries equal to the face amount of the 
policy. If the policyholder does not die during the term of the policy, no payments are 
made by the insurance company. The policyholder is required to make regular monthly 
or annual premium payments (usually constant predetermined amounts) to the insurance 
company for the life of the policy or until the policyholder’s death (whichever is earlier). 
The face amount of the policy typically stays the same or declines with the passage of 
time. A policy where the premium per year is not constant is an annual renewable term 
policy. In this, the insurance company guarantees to renew the policy from one year to 
the next at a rate reflecting the policyholder’s age without regard to the policyholder’s 
health. 

A common reason for term life insurance is a mortgage. For example, a person age 35 
with a 25-year mortgage might choose to buy 25-year term insurance (with a declining 
face amount) to provide dependents with the funds to pay off the mortgage in the event 
of his or her death. 


3.1.2 Whole Life Insurance 


Whole life insurance (sometimes referred to as permanent life insurance) provides protection 
for the life of the policyholder. The policyholder is required to make regular monthly or 
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annual payments until his or her death. The face value of the policy is then paid to the 
designated beneficiary. In the case of term life insurance, there is no certainty that there 
will be a payout, but in the case of whole life insurance, a payout is certain to happen 
providing the policyholder continues to make the agreed premium payments. The only 
uncertainty is when the payout will occur. Not surprisingly, whole life insurance requires 
considerably higher premiums than term life insurance policies. Usually, the payments and 
the face value of the policy both remain constant through time. 

Policyholders can often redeem (surrender) whole life policies early or use the poli- 
cies as collateral for loans. When a policyholder wants to redeem a whole life policy early, 
it is sometimes the case that an investor will buy the policy from the policyholder for 
more than the surrender value offered by the insurance company. The investor will then 
make the premium payments and collect the face value from the insurance company 
when the policyholder dies. 

The annual premium for a year can be compared with the cost of providing term life 
insurance for that year. Consider a man who buys a $1 million whole life policy at age 
40. Suppose that the premium is $20,000 per year. As we will see later, the probability 
of a male age 40 dying within one year is about 0.00209, suggesting that a fair premium 
for one-year insurance is about $2,090. This means that there is a surplus premium of 
$17,910 available for investment from the first year’s premium. The probability of a man 
age 41 dying in one year is about 0.00224, suggesting that a fair premium for insurance 
during the second year is $2,240. This means that there is a $17,760 surplus premium 
available for investment from the second year’s premium. The cost of a one-year policy 
continues to rise as the individual gets older so that at some stage it is greater than the 
annual premium. In our example, this would have happened by the 30th year because the 
probability of a man age 70 dying in one year is 0.02353. (A fair premium for the 30th 
year is $23,530, which is more than the $20,000 received.) The situation is illustrated 
in Figure 3.1. The surplus during the early years is used to fund the deficit during later 
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Figure 3.1 Cost of Life Insurance per Year Compared with the Annual Premium in a Whole Life 
Contract 
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years. There is a savings element to whole life insurance. In the early years, the part 
of the premium not needed to cover the risk of a payout is invested on behalf of the 
policyholder by the insurance company. 

There are tax advantages associated with life insurance policies in many countries. 
If the policyholder invested the surplus premiums, tax would normally be payable on 
the income as it was earned. But, when the surplus premiums are invested within 
the insurance policy, the tax treatment is often better. Tax is deferred, and some- 
times the payout to the beneficiaries of life insurance policies is free of income tax 
altogether. 


3.1.3 Variable Life Insurance 


Given that a whole life insurance policy involves funds being invested for the policy- 
holder, a natural development is to allow the policyholder to specify how the funds are 
invested. Variable life (VL) insurance is a form of whole life insurance where the sur- 
plus premiums discussed earlier are invested in a fund chosen by the policyholder. This 
could be an equity fund, a bond fund, or a money market fund. A minimum guaranteed 
payout on death is usually specified, but the payout can be more if the fund does well. 
Income earned from the investments can sometimes be applied toward the premiums. 
The policyholder can usually switch from one fund to another at any time. 


3.1.4 Universal Life 


Universal life (UL) insurance is also a form of whole life insurance. The policyholder 
can reduce the premium down to a specified minimum without the policy lapsing. The 
surplus premiums are invested by the insurance company in fixed income products such 
as bonds, mortgages, and money market instruments. The insurance company guaran- 
tees a certain minimum return, say 4%, on these funds. The policyholder can choose 
between two options. Under the first option, a fixed benefit is paid on death; under the 
second option, the policyholder’s beneficiaries receive more than the fixed benefit if the 
investment return is greater than the guaranteed minimum. Needless to say, premiums 
are lower for the first option. 


3.1.5 Variable-Universal Life Insurance 


Variable-universal life (VUL) insurance blends the features found in variable life insur- 
ance and universal life insurance. The policyholder can choose between a number of 
alternatives for the investment of surplus premiums. The insurance company guaran- 
tees a certain minimum death benefit and interest on the investments can sometimes 
be applied toward premiums. Premiums can be reduced down to a specified minimum 
without the policy lapsing. 
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3.1.6 Endowment Life Insurance 


Endowment life insurance lasts for a specified period and pays a lump sum either when 
the policyholder dies or at the end of the period, whichever is first. There are many 
different types of endowment life insurance contracts. The amount that is paid out can 
be specified in advance as the same regardless of whether the policyholder dies or sur- 
vives to the end of the policy. Sometimes the payout is also made if the policyholder has 
a critical illness. In a with-profits endowment life insurance policy, the insurance company 
declares periodic bonuses that depend on the performance of the insurance company’s 
investments. These bonuses accumulate to increase the amount paid out to the policy- 
holder, assuming the policyholder lives beyond the end of the life of the policy. In a 
unit-linked endowment, the amount paid out at maturity depends on the performance 
of the fund chosen by the policyholder. A pure endowment policy has the property that a 
payout occurs only if the policyholder survives to the end of the life of the policy. 


3.1.7 Group Life Insurance 


Group life insurance covers many people under a single policy. It is often purchased by a 
company for its employees. The policy may be contributory, where the premium payments 
are shared by the employer and employee, or noncontributory, where the employer pays 
the whole of the cost. There are economies of scale in group life insurance. The selling 
and administration costs are lower. An individual is usually required to undergo medical 
tests when purchasing life insurance in the usual way, but this may not be necessary 
for group life insurance. The insurance company knows that it will be taking on some 
better-than-average risks and some worse-than-average risks. 


3.2 Annuity Contracts 


Many life insurance companies also offer annuity contracts. Where a life insurance con- 
tract has the effect of converting regular payments into a lump sum, an annuity contract 
has the opposite effect: that of converting a lump sum into regular payments. In a typical 
arrangement, the policyholder makes a lump-sum payment to the insurance company 
and the insurance company agrees to provide the policyholder with an annuity that starts 
at a particular date and lasts for the rest of the policyholder’s life. In some instances, the 
annuity starts immediately after the lump-sum payment by the policyholder. More usu- 
ally, the lump-sum payment is made by the policyholder several years ahead of the time 
when the annuity is to start and the insurance company invests the funds to create the 
annuity. (This is referred to as a deferred annuity.) Instead of a lump sum, the policyholder 
sometimes saves for the annuity by making regular monthly, quarterly, or annual payments 
to the insurance company. 

There are often tax deferral advantages to the policyholder. This is because taxes 
usually have to be paid only when the annuity income is received. The amount to which 
the funds invested by the insurance company on behalf of the policyholder have grown 
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in value is sometimes referred to as the accumulation value. Funds can usually be with- 
drawn early, but there are liable to be penalties. In other words, the surrender value of an 
annuity contract is typically less than the accumulation value. This is because the insur- 
ance company has to recover selling and administration costs. Policies sometimes allow 
penalty-free withdrawals where a certain percentage of the accumulation value or a certain 
percentage of the original investment can be withdrawn in a year without penalty. In 
the event that the policyholder dies before the start of the annuity (and sometimes in 
other circumstances such as when the policyholder is admitted to a nursing home), the 
full accumulation value can often be withdrawn without penalty. 

Some deferred annuity contracts in the United States have embedded options. The 
accumulation value is sometimes calculated so that it tracks a particular equity index, 
such as the S&P 500. Lower and upper limits are specified. If the growth in the index in 
a year is less than the lower limit, the accumulation value grows at the lower limit rate; 
if it is greater than the upper limit, the accumulation value grows at the upper limit rate; 
otherwise it grows at the same rate as the S&P 500. Suppose that the lower limit is 0% and 
the upper limit is 8%. The policyholder is assured that the accumulation value will never 
decline, but index growth rates in excess of 8% are given up. In this type of arrangement, 
the policyholder is typically not compensated for dividends that would be received from 
an investment in the stocks underlying the index and the insurance company may be 
able to change parameters such as the lower limit and the upper limit from one year 
to the next. These types of contracts appeal to investors who want an exposure to the 
equity market but are reluctant to risk a decline in their accumulation value. Sometimes, 
the way the accumulation value grows from one year to the next is a quite complicated 
function of the performance of the index during the year. 

In the United Kingdom, the annuity contracts offered by insurance companies used 
to guarantee a minimum level for the interest rate used for the calculation of the size of the 
annuity payments. Many insurance companies regarded this guarantee—an interest rate 
option granted to the policyholder—as a necessary marketing cost and did not calculate 
the cost of the option or hedge their risks. As interest rates declined and life expectancies 
increased, many insurance companies found themselves in financial difficulties and, as 
described in Business Snapshot 3.1, at least one of them went bankrupt. 


3.3 Mortality Tables 


Mortality tables are the key to valuing life insurance contracts. Table 3.1 shows an extract 
from the mortality rates estimated by the U.S. Department of Social Security from statis- 
tics on death rates as a function of age collected in 2013. To understand the table, consider 
the row corresponding to age 31. The second column shows that the probability of a 
man who has just reached age 31 dying within the next year is 0.001505 (or 0.1505%). 
The third column shows that the probability of a man surviving to age 31 is 0.97376 (or 
97.376%). The fourth column shows that a man age 31 has a remaining life expectancy 
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BUSINESS SNAPSHOT 3.1 
Equitable Life 


Equitable Life was a British life insurance company founded in 1762 that at 
its peak had 1.5 million policyholders. Starting in the 1950s, Equitable Life 
sold annuity products where it guaranteed that the interest rate used to cal- 
culate the size of the annuity payments would be above a certain level. (This is 
known as a Guaranteed Annuity Option, GAO.) The guaranteed interest rate was 
gradually increased in response to competitive pressures and increasing interest 
rates. Toward the end of 1993, interest rates started to fall. Also, life expectancies 
were rising so that the insurance companies had to make increasingly high provi- 
sions for future payouts on contracts. Equitable Life did not take action. Instead, 
it grew by selling new products. In 2000, it was forced to close its doors to new 
business. A report issued by Ann Abraham in July 2008 was highly critical of 
regulators and urged compensation for policyholders. 

An interesting aside to this is that regulators did at one point urge insurance 
companies that offered GAOs to hedge their exposures to an interest rate decline. 
As a result, many insurance companies scrambled to enter into contracts with 
banks that paid off if long-term interest rates declined. The banks in turn hedged 
their risk by buying instruments such as bonds that increased in price when 
rates fell. This was done on such a massive scale that the extra demand for bonds 
caused long-term interest rates in the UK to decline sharply (increasing losses for 
insurance companies on the unhedged part of their exposures). This shows that 
when large numbers of different companies have similar exposures, problems are 
created if they all decide to hedge at the same time. There are not likely to be 
enough investors willing to take on their risks without market prices changing. 


of 46.89 years. This means than on average he will live to age 77.89. The remaining three 
columns show similar statistics for a woman. The probability of a 31-year-old woman 
dying within one year is 0.000705 (0.0705%), the probability of a woman surviving to 
age 31 is 0.98569 (98.569%), and the remaining life expectancy for a 31-year-old woman 
is 51.04 years. 

The full table shows that the probability of death during the following year is a 
decreasing function of age for the first 10 years of life and then starts to increase. Mortality 
statistics for women are a little more favorable than for men. If a man is lucky enough 
to reach age 90, the probability of death in the next year is about 16.7%. The full table 
shows this probability is about 35.4% at age 100 and 57.6% at age 110. For women, the 
corresponding probabilities are 13.2%, 30.5%, and 54.6%, respectively. 
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Table 3.1 Mortality Table 


Male Female 
Probability Probability 

Age of Death Survival Life of Death Survival Life 
(Years) within 1 Year Probability Expectancy within 1 Year Probability Expectancy 

0 0.006519 1.00000 76.28 0.005377 1.00000 81.05 

1 0.000462 0.99348 75.78 0.000379 0.99462 80.49 

2 0.000291 0.99302 74.82 0.000221 0.99425 79.52 

3 0.000209 0.99273 73.84 0.000162 0.99403 78.54 
30 0.001467 0.97519 47.82 0.000664 0.98635 52.01 
31 0.001505 0.97376 46.89 0.000705 0.98569 51.04 
32 0.001541 0.97230 45.96 0.000748 0.98500 50.08 
33 0.001573 0.97080 45.03 0.000794 0.98426 49.11 
40 0.002092 0.95908 38.53 0.001287 0.97753 42.43 
41 0.002240 0.95708 37.61 0.001393 0.97627 41.48 
42 0.002418 0.95493 36.70 0.001517 0.97491 40.54 
43 0.002629 0.95262 35.78 0.001662 0.97343 39.60 
50 0.005038 0.92940 29.58 0.003182 0.95829 33.16 
51 0.005520 0.92472 28.73 0.003473 0.95524 32.27 
52 0.006036 0.91961 27.89 0.003767 0.95193 31.38 
53 0.006587 0.91406 27.05 0.004058 0.94834 30.49 
60 0.011197 0.86112 21.48 0.006545 0.91526 24.46 
61 0.012009 0.85147 20.72 0.007034 0.90927 23.62 
62 0.012867 0.84125 19.97 0.007607 0.90287 22.78 
63 0.013772 0.83042 19.22 0.008281 0.89600 21.95 
70 0.023528 0.73461 14.24 0.015728 0.82864 16.43 
71 0.025693 0.71732 13.57 0.017338 0.81561 15.68 
72 0.028041 0.69889 12.92 0.019108 0.80147 14.95 
73 0.030567 0.67930 12.27 0.021041 0.78616 14.23 
80 0.059403 0.50629 8.20 0.043289 0.63880 9.64 
81 0.065873 0.47621 7.68 0.048356 0.61114 9.05 
82 0.073082 0.44484 7.19 0.054041 0.58159 8.48 
83 0.081070 0.41233 6.72 0.060384 0.55016 7.94 
90 0.167291 0.17735 4.03 0.132206 0.29104 4.80 
91 0.184520 0.14768 3.74 0.147092 0.25257 4.45 
92 0.202954 0.12043 3.47 0.163154 0.21542 4.13 
93 0.222555 0.09599 3.23 0.180371 0.18027 3.84 


Source: U.S. Department of Social Security, www.ssa.gov/OACT/STATS/table4c6. html. 
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All the numbers in the table can be calculated from the “Probability of Death within 
1 Year” column. The probability that an individual will survive to age n+ 1 is the prob- 
ability that the individual will survive to age n multiplied by 1 minus the probability 
that an individual aged n will die within the next year. For example, the probability of 
surviving to age 61 can be calculated as 0.86112 x (1 — 0.011197). 

Next consider life expectancy for an individual of a certain age. This can be deter- 
mined from the probability of death during the first year, the second year, the third year, 
and so on. Consider a man age 90. The probability that the man will die within one year 
is, from the table, 0.167291. The probability that he will die in the second year (between 
ages 91 and 92) is the probability that he does not die in the first year multiplied by the 
probability that he does die in the second year. From the numbers in the second column 
of the table, this is 


(1 — 0.167291) x 0.184520 = 0.153651 
Similarly, the probability that he dies in the third year (between ages 92 and 93) is 
(1 — 0.167291) x (1 — 0.184520) x 0.202954 = 0.137817 


Assuming that death occurs on average halfway through a year, the life expectancy of a 
man age 90 is 


0.5 X 0.167291 + 1.5 X 0.153651 + 2.5 X 0.137817 + «+ 


Example 3.1 


Assume that interest rates for all maturities are 4% per annum (with semiannual com- 
pounding) and premiums are paid once a year at the beginning of the year. What is 
an insurance company’s break-even premium for $100,000 of term life insurance for a 
man of average health age 90? If the term insurance lasts one year, the expected pay- 
out is 0.167291 X 100,000 or $16,729. Assume that the payout occurs halfway through 
the year. (This is likely to be approximately true on average.) The premium is $16,729 
discounted for six months. This is 16,729/1.02 or $16,401. 

Suppose next that the term insurance lasts two years. In this case, the present value 
of expected payout in the first year is $16,401 as before. The probability that the policy- 
holder dies during the second year is (1 — 0.167291) X 0.184520 = 0.153651 so that 
there is also an expected payout of 0.153651 x 100,000 or $15,365 during the sec- 
ond year. Assuming this happens at time 18 months, the present value of the payout 
is 15,365/(1.023) or $14,479. The total present value of payouts is 16,401 + 14,479 or 
$30,880. 

Consider next the premium payments. The first premium is required at time zero, 
so we are certain that this will be paid. The probability of the second premium payment 
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being made at the beginning of the second year is the probability that the man does 
not die during the first year. This is 1 — 0.167291 = 0.832709. When the premium is X 
dollars per year, the present value of the premium payments is 


age BEANS 24 800874 
(1.02)2 


The break-even annual premium is given by the value of X that equates the present value 
of the expected premium payments to the present value of the expected payout. This is 
the value of X that solves 


1.800374X = 30,880 


or X = 17,152. The break-even premium payment is therefore $17,152. 


3.4 Longevity and Mortality Risk 


Longevity risk is the risk that advances in medical sciences and lifestyle changes will lead 
to people living longer. Increases in longevity adversely affect the profitability of most 
types of annuity contracts (because the annuity has to be paid for longer), but increase the 
profitability of most life insurance contracts (because the final payout is either delayed 
or, in the case of term insurance, less likely to happen). Life expectancy varies from 
country to country, but has been steadily increasing in most parts of the world. Average 
life expectancy of a child born in the United States in 2013 is estimated to be about 
20 years higher than for a child born in 1929. The statistics in Table 3.1 are based on 
the death rates observed in 2013 for people of different ages. If people continue to live 
longer, Table 3.1 understates life expectancies. 

Mortality risk is the risk that wars, epidemics such as AIDS, or pandemics such as 
Spanish flu will lead to people living not as long as expected. This adversely affects the 
payouts on most types of life insurance contracts (because the insured amount has to 
be paid earlier than expected), but should increase the profitability of annuity contracts 
(because the annuity is not paid out for as long). In calculating the impact of mortality 
risk, it is important to consider the age groups within the population that are likely to 
be most affected by a particular event. 

To some extent, the longevity and mortality risks in the annuity business of a life 
insurance company offset those in its regular life insurance contracts. Actuaries must 
carefully assess the insurance company’s net exposure under different scenarios. If the 
exposure is unacceptable, they may decide to enter into reinsurance contracts for some 
of the risks. Reinsurance is discussed later in this chapter. 
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3.4.1 Longevity Derivatives 


A longevity derivative provides payoffs that are potentially attractive to insurance com- 
panies when they are concerned about their longevity exposure on annuity contracts 
and to pension funds. A typical contract is a longevity bond, also known as a survivor bond, 
which first traded in the late 1990s. A population group is defined and the coupon on 
the bond at any given time is defined as being proportional to the number of individuals 
in the population who are still alive. 

Who will sell such bonds to insurance companies and pension funds? The answer is 
some speculators find the bonds attractive because they have very little systematic risk. 
(See Section 1.3 for a discussion of systematic risk.) The bond payments depend on how 
long people live and this is largely uncorrelated with returns from the market. 


3.5 Property-Casualty Insurance 


Property-casualty insurance can be subdivided into property insurance and casualty 
insurance. Property insurance provides protection against loss of or damage to property 
(from fire, theft, water damage, etc.). Casualty insurance provides protection against legal 
liability exposures (from, for example, injuries caused to third parties). Casualty insur- 
ance might more accurately be referred to as liability insurance. Sometimes both types of 
insurance are included in a single policy. For example, a home owner might buy insur- 
ance that provides protection against various types of loss such as property damage and 
theft as well as legal liabilities if others are injured while on the property. Similarly, car 
insurance typically provides protection against theft of, or damage to, one’s own vehicle 
as well as protection against claims brought by others. 

Typically, property-casualty policies are renewed from year to year and the insurance 
company will change the premium if its assessment of the expected payout changes. (This 
is different from life insurance, where premiums tend to remain the same for the life of 
the policy.) Because property-casualty insurance companies get involved in many differ- 
ent types of insurance, there is some natural risk diversification. Also, for some risks, the 
“law of large numbers” applies. For example, if an insurance company has written policies 
protecting 250,000 home owners against losses from theft and fire damage, the expected 
payout can be predicted reasonably accurately. This is because the policies provide pro- 
tection against a large number of (almost) independent events. (Of course, there are liable 
to be trends through time in the number of losses and size of losses, and the insurance 
company should keep track of these trends in determining year-to-year changes in the 
premiums.) 

Property damage arising from natural disasters such as hurricanes gives rise to payouts 
for an insurance company that are much less easy to predict. For example, Hurricane Kat- 
rina in the United States in the summer of 2005 and a heavy storm in northwest Europe 
in January 2007 that measured 12 on the Beaufort scale proved to be very expensive. 
These are termed catastrophic risks. The problem with them is that the claims made by 
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different policyholders are not independent. Either a hurricane happens in a year and 
the insurance company has to deal with a large number of claims for hurricane-related 
damage or there is no hurricane in the year and therefore no claims are made. Most 
large insurers have models based on geographical, seismographical, and meteorological 
information to estimate the probabilities of catastrophes and the losses resulting there- 
from. This provides a basis for setting premiums, but it does not alter the “all-or-nothing” 
nature of these risks for insurance companies. 

Liability insurance, like catastrophe insurance, gives rise to total payouts that vary 
from year to year and are difficult to predict. For example, claims arising from asbestos- 
related damage to workers’ health have proved very expensive for insurance companies 
in the United States. A feature of liability insurance is what is known as long-tail risk. 
This is the possibility of claims being made several years after the insured period is over. 
In the case of asbestos, for example, the health risks were not realized until some time 
after exposure. As a result, the claims, when they were made, were under policies that 
had been in force several years previously. This creates a complication for actuaries and 
accountants. They cannot close the books soon after the end of each year and calculate 
a profit or loss. They must allow for the cost of claims that have not yet been made, but 
may be made some time in the future. 


3.5.1 CAT Bonds 


The derivatives market has come up with a number of products for hedging catastrophic 
risk. The most popular is a catastrophe (CAT) bond. This is a bond issued by a subsidiary 
of an insurance company that pays a higher-than-normal interest rate. In exchange for 
the extra interest, the holder of the bond agrees to cover payouts on a particular type of 
catastrophic risk that are in a certain range. Depending on the terms of the CAT bond, 
the interest or principal (or both) can be used to meet claims. 

Suppose an insurance company has a $70 million exposure to California earthquake 
losses and wants protection for losses over $40 million. The insurance company could 
issue CAT bonds with a total principal of $30 million. In the event that the insurance 
company’s California earthquake losses exceeded $40 million, bondholders would lose 
some or all of their principal. As an alternative, the insurance company could cover the 
same losses by making a much bigger bond issue where only the bondholders’ interest 
is at risk. Yet another alternative is to make three separate bond issues covering losses in 
the range of $40 to $50 million, $50 to $60 million, and $60 to $70 million, respectively. 

CAT bonds typically give a high probability of an above-normal rate of interest 
and a low probability of a high loss. Why would investors be interested in such instru- 
ments? The answer is that the return on CAT bonds, like the longevity bonds considered 
earlier, has no statistically significant correlations with market returns.! CAT bonds are 


| See R. H. Litzenberger, D. R. Beaglehole, and C. E. Reynolds, “Assessing Catastrophe Reinsurance- 
Linked Securities as a New Asset Class,” Journal of Portfolio Management (Winter 1996): 76—86. 
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therefore an attractive addition to an investor’s portfolio. Their total risk can be com- 
pletely diversified away in a large portfolio. Ifa CAT bond’s expected return is greater 
than the risk-free interest rate (and typically it is), it has the potential to improve risk- 
return trade-offs. 


3.5.2 Ratios Calculated by Property-Casualty Insurers 


Insurance companies calculate a loss ratio for different types of insurance. This is the ratio 
of payouts made to premiums earned in a year. Loss ratios are typically in the 60% to 
80% range. Statistics published by A. M. Best show that loss ratios in the United States 
have tended to increase through time. The expense ratio for an insurance company is 
the ratio of expenses to premiums earned in a year. The two major sources of expenses 
are loss adjustment expenses and selling expenses. Loss adjustment expenses are those 
expenses related to determining the validity of a claim and how much the policyholder 
should be paid. Selling expenses include the commissions paid to brokers and other 
expenses concerned with the acquisition of business. Expense ratios in the United States 
are typically in the 25% to 30% range and have tended to decrease through time. 

The combined ratio is the sum of the loss ratio and the expense ratio. Suppose that for 
a particular category of policies in a particular year the loss ratio is 75% and the expense 
ratio is 30%. The combined ratio is then 105%. Sometimes a small dividend is paid to 
policyholders. Suppose that this is 1% of premiums. When this is taken into account 
we obtain what is referred to as the combined ratio after dividends. This is 106% in our 
example. This number suggests that the insurance company has lost 6% before tax on the 
policies being considered. In fact, this may not be the case. Premiums are generally paid by 
policyholders at the beginning of a year and payouts on claims are made during the year, 
or after the end of the year. The insurance company is therefore able to earn interest on 
the premiums during the time that elapses between the receipt of premiums and payouts. 
Suppose that, in our example, investment income is 9% of premiums received. When the 
investment income is taken into account, a ratio of 106 — 9 = 97% is obtained. This is 
referred to as the operating ratio. Table 3.2 summarizes this example. 


Table 3.2 Example Showing Calculation of 
Operating Ratio for a Property-Casualty 
Insurance Company 


Loss ratio 75% 
Expense ratio 30% 
Combined ratio 105% 
Dividends 1% 
Combined ratio after dividends 106% 
Investment income (9%) 


Operating ratio 97% 
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3.6 Health Insurance 


Health insurance has some of the attributes of property-casualty insurance and some of 
the attributes of life insurance. It is sometimes considered to be a totally separate category 
of insurance. The extent to which health care is provided by the government varies from 
country to country. In the United States publicly funded health care has traditionally 
been limited and health insurance has therefore been an important consideration for 
most people. Canada is at the other extreme: nearly all health care needs are provided by 
a publicly funded system. Doctors are not allowed to offer most services privately. The 
main role of health insurance in Canada is to cover prescription costs and dental care, 
which are not funded publicly. In most other countries, there is a mixture of public and 
private health care. The United Kingdom, for example, has a publicly funded health care 
system, but some individuals buy insurance to have access to a private system that operates 
side by side with the public system. (The main advantage of private health insurance is a 
reduction in waiting times for routine elective surgery.) 

In 2010, President Obama signed into law the Patient Protection and Affordable Care 
Act in an attempt to reform health care in the United States and increase the number of 
people with medical coverage. The eligibility for Medicaid (a program for low-income 
individuals) was expanded and subsidies were provided for low- and middle-income 
families to help them buy insurance. The act prevents health insurers from taking pre- 
existing medical conditions into account and requires employers to provide coverage to 
their employees or pay additional taxes. One difference between the United States and 
many other countries continues to be that health insurance is largely provided by the 
private rather than the public sector. 

In health insurance, as in other forms of insurance, the policyholder makes regular 
premium payments and payouts are triggered by events. Examples of such events are 
the policyholder needing an examination by a doctor, the policyholder requiring treat- 
ment at a hospital, and the policyholder requiring prescription medication. Typically the 
premiums increase because of overall increases in the costs of providing health care. How- 
ever, they usually cannot increase because the health of the policyholder deteriorates. It 
is interesting to compare health insurance with auto insurance and life insurance in this 
respect. An auto insurance premium can increase (and usually does) if the policyholder’s 
driving record indicates that expected payouts have increased and if the costs of repairs to 
automobiles have increased. Life insurance premiums do not increase—even if the pol- 
icyholder is diagnosed with a health problem that significantly reduces life expectancy. 
Health insurance premiums are like life insurance premiums in that changes to the insur- 
ance company’s assessment of the risk of a payout do not lead to an increase in premiums. 
However, it is like auto insurance in that increases in the overall costs of meeting claims 
do lead to premium increases. 

Of course, when a policy is first issued, an insurance company does its best to 
determine the risks it is taking on. In the case of life insurance, questions concern- 
ing the policyholders health have to be answered, pre-existing medical conditions 
have to be declared, and physical examinations may be required. In the case of auto 
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insurance, the policyholder’s driving record is investigated. In both of these cases, insur- 
ance can be refused. In the case of health insurance, legislation sometimes determines 
the circumstances under which insurance can be refused. As indicated earlier, the 
Patient Protection and Affordable Health Care Act makes it very difficult for insur- 
ance companies in the United States to refuse coverage because of pre-existing medical 
conditions. 

Health insurance is often provided by the group health insurance plans of employers. 
These plans typically cover the employee and the employee’s family. The cost of the 
health insurance is sometimes split between the employer and employee. The expenses 
that are covered vary from plan to plan. In the United States, most plans cover basic 
medical needs such as medical check-ups, physicals, treatments for common disorders, 
surgery, and hospital stays. Pregnancy costs may or may not be covered. Procedures such 
as cosmetic surgery are usually not covered. 


3.7 Moral Hazard and Adverse Selection 


We now consider two key risks facing insurance companies: moral hazard and adverse 
selection. 


3.7.1 Moral Hazard 


Moral hazard is the risk that the existence of insurance will cause the policyholder to 
behave differently than he or she would without the insurance. This different behavior 
increases the risks and the expected payouts of the insurance company. Three examples 
of moral hazard are: 


1. A car owner buys insurance to protect against the car being stolen. As a result of the 
insurance, he or she becomes less likely to lock the car. 

2. An individual purchases health insurance. As a result of the existence of the policy, 
more health care is demanded than previously. 

3. As a result of a government-sponsored deposit insurance plan, a bank takes more 
risks because it knows that it is less likely to lose depositors because of this strategy. 
(This was discussed in Section 2.3.) 


Moral hazard is not a big problem in life insurance. Insurance companies have tradi- 
tionally dealt with moral hazard in property-casualty and health insurance in a number 
of ways. Typically there is a deductible. This means that the policyholder is responsible for 
bearing the first part of any loss. Sometimes there is a co-insurance provision in a policy. The 
insurance company then pays a predetermined percentage (less than 100%) of losses in 
excess of the deductible. In addition there is nearly always a policy limit (1.e.,an upper limit 
to the payout). The effect of these provisions is to align the interests of the policyholder 
more closely with those of the insurance company. 
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3.7.2 Adverse Selection 


Adverse selection is the phrase used to describe the problems an insurance company has 
when it cannot distinguish between good and bad risks. It offers the same price to every- 
one and inadvertently attracts more of the bad risks. If an insurance company is not able 
to distinguish good drivers from bad drivers and offers the same auto insurance premium 
to both, it is likely to attract more bad drivers. If it is not able to distinguish healthy from 
unhealthy people and offers the same life insurance premiums to both, it is likely to 
attract more unhealthy people. 

To lessen the impact of adverse selection, an insurance company tries to find out as 
much as possible about the policyholder before committing itself. Before offering life 
insurance, it often requires the policyholder to undergo a physical examination by an 
approved doctor. Before offering auto insurance to an individual, it will try to obtain as 
much information as possible about the individual’s driving record. In the case of auto 
insurance, it will continue to collect information on the driver’s risk (number of accidents, 
number of speeding tickets, etc.) and make year-to-year changes to the premium to reflect 
this. 

Adverse selection can never be completely overcome. It is interesting that, in spite of 
the physical examinations that are required, individuals buying life insurance tend to die 
earlier than mortality tables would suggest. But individuals who purchase annuities tend 
to live longer than mortality tables would suggest. 


3.8 Reinsurance 


Reinsurance is an important way in which an insurance company can protect itself against 
large losses by entering into contracts with another insurance company. For a fee, the 
second insurance company agrees to be responsible for some of the risks that have been 
insured by the first company. Reinsurance allows insurance companies to write more 
policies than they would otherwise be able to. Some of the counterparties in reinsurance 
contracts are other insurance companies or rich private individuals; others are companies 
that specialize in reinsurance such as Swiss Re and Warren Buffett’s company, Berkshire 
Hathaway. 

Reinsurance contracts can take a number of forms. Suppose that an insurance com- 
pany has an exposure of $100 million to hurricanes in Florida and wants to limit this 
to $50 million. One alternative is to enter into annual reinsurance contracts that cover 
on a pro rata basis 50% of its exposure. (The reinsurer would then probably receive 50% 
of the premiums.) If hurricane claims in a particular year total $70 million, the costs to 
the insurance company would be only 0.5 x $70 or $35 million, and the reinsurance 
company would pay the other $35 million. 

Another, more popular alternative, involving lower reinsurance premiums, is to buy 
a series of reinsurance contracts covering what are known as excess cost layers. The first 
layer might provide indemnification for losses between $50 million and $60 million, the 
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Table 3.3 Abbreviated Balance Sheet for Life Insurance Company 


Assets Liabilities and Net Worth 
Investments 90 Policy reserves 80 
Other assets 10 Subordinated long-term debt 10 

Equity capital 10 
Total 100 Total 100 


next layer might cover losses between $60 million and $70 million, and so on. Each 
reinsurance contract is known as an excess-of-loss reinsurance contract. 


3.9 Capital Requirements 


The balance sheets for life insurance and property-casualty insurance companies are dif- 
ferent because the risks taken and the reserves that must be set aside for future payouts 
are different. 


3.9.1 Life Insurance Companies 


Table 3.3 shows an abbreviated balance sheet for a life insurance company. Most of the 
life insurance company’s investments are in corporate bonds. The insurance company 
tries to match the maturity of its assets with the maturity of liabilities. However, it takes 
on credit risk because the default rate on the bonds may be higher than expected. 

Unlike a bank, an insurance company has exposure on the liability side of the balance 
sheet as well as on the asset side. The policy reserves (80% of assets in this case) are 
estimates (usually conservative) of actuaries for the present value of payouts on the policies 
that have been written. The estimates may prove to be low if the holders of life insurance 
policies die earlier than expected or the holders of annuity contracts live longer than 
expected. The 10% equity on the balance sheet includes the original equity contributed 
and retained earnings and provides a cushion. If payouts are greater than loss reserves by 
an amount equal to 5% of assets, equity will decline, but the life insurance company will 
survive. 


3.9.2 Property-Casualty Insurance Companies 


Table 3.4 shows an abbreviated balance sheet for a property-casualty life insurance com- 
pany. A key difference between Table 3.3 and Table 3.4 is that the equity in Table 3.4 is 
much higher. This reflects the differences in the risks taken by the two sorts of insurance 
companies. The payouts for a property-casualty company are much less easy to predict 
than those for a life insurance company. Who knows when a hurricane will hit Miami 
or how large payouts will be for the next asbestos-like liability problem? The unearned 
premiums item on the liability side represents premiums that have been received, but 
apply to future time periods. If a policyholder pays $2,500 for house insurance on 
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Table 3.4 Abbreviated Balance Sheet for Property-Casualty 
Insurance Company 


Assets Liabilities and Net Worth 
Investments 90 Policy reserves 45 
Other assets 10 Unearned premiums 15 

Subordinated long-term debt 10 
Equity capital 30 
Total 100 Total 100 


June 30 of a year, only $1,250 has been earned by December 31 of the year. The invest- 
ments in Table 3.4 consist largely of liquid bonds with shorter maturities than the bonds 
in Table 3.3. 


3.10 The Risks Facing Insurance Companies 


The most obvious risk for an insurance company is that the policy reserves are not 
sufficient to meet the claims of policyholders. Although the calculations of actuaries 
are usually fairly conservative, there is always the chance that payouts much higher than 
anticipated will be required. Insurance companies also face risks concerned with the 
performance of their investments. Many of these investments are in corporate bonds. If 
defaults on corporate bonds are above average, the profitability of the insurance company 
will suffer. It is important that an insurance company’s bond portfolio be diversified by 
business sector and geographical region. An insurance company also needs to monitor 
the liquidity risks associated with its investments. Illiquid bonds (e.g., those the insurance 
company might buy in a private placement) tend to provide higher yields than bonds 
that are publicly owned and actively traded. However, they cannot be as readily converted 
into cash to meet unexpectedly high claims. Insurance companies enter into transactions 
with banks and reinsurance companies. This exposes them to credit risk. Like banks, 
insurance companies are also exposed to operational risks and business risks. 

Regulators specify minimum capital requirements for an insurance company to pro- 
vide a cushion against losses. Insurance companies, like banks, have also developed their 
own procedures for calculating economic capital. This is their own internal estimate of 
required capital (see Chapter 26). 


3.11 Regulation 


The ways in which insurance companies are regulated in the United States and Europe 
are quite different. 


3.11.1 United States 


In the United States, the McCarran-Ferguson Act of 1945 confirmed that insurance 
companies are regulated at the state level rather than the federal level. (Banks, by 
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contrast, are regulated at the federal level.) State regulators are concerned with the sol- 
vency of insurance companies and their ability to satisfy policyholders’ claims. They are 
also concerned with business conduct (i.e., how premiums are set, advertising, contract 
terms, the licensing of insurance agents and brokers, and so on). 

The National Association of Insurance Commissioners (NAIC) is an organization 
consisting of the chief insurance regulatory officials from all 50 states. It provides a 
national forum for insurance regulators to discuss common issues and interests. It also 
provides some services to state regulatory commissions. For example, it provides statistics 
on the loss ratios of property-casualty insurers. This helps state regulators identify those 
insurers for which the ratios are outside normal ranges. 

Insurance companies are required to file detailed annual financial statements with 
state regulators, and the state regulators conduct periodic on-site reviews. Capital require- 
ments are determined by regulators using risk-based capital standards determined by 
NAIC. These capital levels reflect the risk that policy reserves are inadequate, that 
counterparties in transactions default, and that the return on investments are less than 
expected. 

The policyholder is protected against an insurance company becoming insolvent 
(and therefore unable to make payouts on claims) by insurance guaranty associations. An 
insurer is required to be a member of the guaranty association in a state as a condition of 
being licensed to conduct business in the state. When there is an insolvency by another 
insurance company operating in the state, each insurance company operating in the state 
has to contribute an amount to the state guaranty fund that is dependent on the premium 
income it collects in the state. The fund is used to pay the small policyholders of the 
insolvent insurance company. (The definition of a small policyholder varies from state to 
state.) There may be a cap on the amount the insurance company has to contribute to 
the state guaranty fund in a year. This can lead to the policyholder having to wait several 
years before the guaranty fund is in a position to make a full payout on its claims. In the 
case of life insurance, where policies last for many years, the policyholders of insolvent 
companies are usually taken over by other insurance companies. However, there may be 
some change to the terms of the policy so that the policyholder is somewhat worse off 
than before. 

The guaranty system for insurance companies in the United States is therefore dif- 
ferent from that for banks. In the case of banks, there is a permanent fund created from 
premiums paid by banks to the FDIC to protect depositors. In the case of insurance com- 
panies, there is no permanent fund. Insurance companies have to make contributions after 
an insolvency has occurred. An exception to this is property-casualty companies in New 
York State, where a permanent fund does exist. 

Regulating insurance companies at the state level is unsatisfactory in some respects. 
Regulations are not uniform across the different states. A large insurance company that 
operates throughout the United States has to deal with a large number of different regu- 
latory authorities. Some insurance companies trade derivatives in the same way as banks, 
but are not subject to the same regulations as banks. This can create problems. In 2008, it 
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transpired that a large insurance company, American International Group (AIG), incurred 
huge losses trading credit derivatives and had to be bailed out by the federal government. 

The Dodd—Frank Act of 2010 set up the Federal Insurance Office (FIO), which is 
housed in the Department of the Treasury. It is tasked with monitoring the insurance 
industry and identifying gaps in regulation. It can recommend to the Financial Stabil- 
ity Oversight Council that a large insurance company (such as AIG) be designated as a 
nonbank financial company supervised by the Federal Reserve. It also liaises with regu- 
lators in other parts of the world (particularly, those in the European Union) to foster the 
convergence of regulatory standards. The Dodd—Frank Act required the FIO to “con- 
duct a study and submit a report to Congress on how to modernize and improve the 
system of insurance regulation in the United States.” The FIO submitted its report in 
December 2013 It identified changes necessary to improve the U.S. system of insur- 
ance regulation. It may be the case that the United States will move to a federal system for 
insurance regulation. However, it is also possible that President Donald Trump’s changes 
to Dodd-Frank will abolish the FIO, leaving regulation almost entirely in the hands of the 
states. 


3.11.2 Europe 


In the European Union, insurance companies are regulated centrally. This means that 
in theory the same regulatory framework applies to insurance companies throughout 
all member countries. The original framework (established in the 1970s) is known as 
Solvency I. It was heavily influenced by research carried out by Professor Cornelis 
Campagne from the Netherlands who showed that, with a capital equal to 4% of policy 
provisions, life insurance companies have a 95% chance of surviving. Investment risks are 
not explicitly considered by Solvency I. 

A number of countries, such as the UK, the Netherlands, and Switzerland, have 
developed their own plans to overcome some of the weaknesses in Solvency I. Solvency 
II, which assigns capital for a wider set of risks than Solvency I, is being implemented in 
the European Union during a period starting in 2016. Both Solvency I and Solvency II 
are discussed further in Chapter 15. 


3.12 Pension Plans 


Pension plans are set up by companies for their employees. Typically, contributions are 
made to a pension plan by both the employee and the employer while the employee is 
working. When the employee retires, he or she receives a pension until death. A pension 
fund therefore involves the creation of a lifetime annuity from regular contributions and 


?See “How to Modernize and Improve the System of Insurance Regulation in the United States,” 
Federal Insurance Office, December 2013. 
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has similarities to some of the products offered by life insurance companies. There are 
two types of pension plans: defined benefit and defined contribution. 

In a defined benefit plan, the pension that the employee will receive on retirement is 
defined by the plan. Typically it is calculated by a formula that is based on the number of 
years of employment and the employee’s salary. For example, the pension per year might 
equal the employee’s average earnings per year during the last three years of employment 
multiplied by the number of years of employment multiplied by 2%. The employee’s 
spouse may continue to receive a (usually reduced) pension if the employee dies before 
the spouse. In the event of the employee’s death while still employed, a lump sum is often 
payable to dependents and a monthly income may be payable to a spouse or dependent 
children. Sometimes pensions are adjusted for inflation. This is known as indexation. For 
example, the indexation in a defined benefit plan might lead to pensions being increased 
each year by 75% of the increase in the consumer price index. Pension plans that are 
sponsored by governments (such as Social Security in the United States) are similar to 
defined benefit plans in that they require regular contributions up to a certain age and 
then provide lifetime pensions. 

In a defined contribution plan the employer and employee contributions are invested on 
behalf of the employee. When employees retire, there are typically a number of options 
open to them. The amount to which the contributions have grown can be converted to 
a lifetime annuity. In some cases, the employee can opt to receive a lump sum instead of 
an annuity. 

The key difference between a defined contribution and a defined benefit plan is 
that, in the former, the funds are identified with individual employees. An account is 
set up for each employee and the pension is calculated only from the funds contributed 
to that account. By contrast, in a defined benefit plan, all contributions are pooled and 
payments to retirees are made out of the pool. In the United States, a 401(k) plan is 
a form of defined contribution plan where the employee elects to have some portion 
of his or her income directed to the plan (with possibly some employer matching) and 
can choose between a number of investment alternatives (e.g., stocks, bonds, and money 
market instruments). 

An important aspect of both defined benefit and defined contribution plans is the 
deferral of taxes. No taxes are payable on money contributed to the plan by the employee 
and contributions by a company are deductible. Taxes are payable only when pension 
income is received (and at this time the employee may have a relatively low marginal tax 
rate). 

Defined contribution plans involve very little risk for employers. If the performance 
of the plan’s investments is less than anticipated, the employee bears the cost. By contrast, 
defined benefit plans impose significant risks on employers because they are ultimately 
responsible for paying the promised benefits. Let us suppose that the assets of a defined 
benefit plan total $100 million and that actuaries calculate the present value of the obli- 
gations to be $120 million. The plan is $20 million underfunded and the employer is 
required to make up the shortfall (usually over a number of years). The risks posed by 
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BUSINESS SNAPSHOT 3.2 
A Perfect Storm 


During the period from December 31, 1999, to December 31, 2002, the S&P 
500 declined by about 40% from 1469.25 to 879.82, and 20-year Treasury rate 
in the United States declined by 200 basis points from 6.83% to 4.83%. The 
impact of the first of these events was that the market value of the assets of 
defined benefit pension plans declined sharply. The impact of the second of the 
two events was that the discount rate used by defined benefit plans for their 
liabilities decreased so that the fair value of the liabilities calculated by actuar- 
ies increased. This created a “perfect storm” for the pension plans. Many funds 
that had been overfunded became underfunded. Funds that had been slightly 
underfunded became much more seriously underfunded. 

When a company has a defined benefit plan, the value of its equity is adjusted 
to reflect the amount by which the plan is overfunded or underfunded. It is not 
surprising that many companies have tried to replace defined benefit pension 
plans with defined contribution plans to avoid the risk of equity being eroded 
by a perfect storm. 


defined benefit plans have led some companies to convert defined benefit plans to defined 
contribution plans. 

Estimating the present value of the liabilities in defined benefit plans is not easy. An 
important issue is the discount rate used. The higher the discount rate, the lower the 
present value of the pension plan liabilities. It used to be common to use the average rate 
of return on the assets of the pension plan as the discount rate. This encourages the pen- 
sion plan to invest in equities because the average return on equities is higher than the 
average return on bonds, making the value of the liabilities look low. Accounting stan- 
dards now recognize that the liabilities of pension plans are obligations similar to bonds 
and require the liabilities of the pension plans of private companies to be discounted 
at AA-rated bond yields. The difference between the value of the assets of a defined 
benefit plan and that of its liabilities must be recorded as an asset or liability on the bal- 
ance sheet of the company. Thus, if a company’s defined benefit plan is underfunded, 
the company’s shareholder equity is reduced. A perfect storm is created when the assets 
of a defined benefit pension plan decline sharply in value and the discount rate for its 
liabilities decreases sharply (see Business Snapshot 3.2). 


3.12.1 Are Defined Benefit Plans Viable? 


A typical defined benefit plan provides the employee with about 70% of final salary as a 
pension and includes some indexation for inflation. What percentage of the employee’s 
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income during his or her working life should be set aside for providing the pension? The 
answer depends on assumptions about interest rates, how fast the employee’s income rises 
during the employee’s working life, and so on. But, if an insurance company were asked 
to provide a quote for the sort of defined benefit plan we are considering, the required 
contribution rate would be about 25% of income each year. (Problems 3.15 and 3.19 
provide an indication of calculations that can be carried out.) The insurance company 
would invest the premiums in corporate bonds (in the same way that it does the premiums 
for life insurance and annuity contracts) because this provides the best way of matching 
the investment income with the payouts. 

The contributions to defined benefit plans (employer plus employee) are much less 
than 25% of income. In a typical defined benefit plan, the employer and employee each 
contribute around 5%. The total contribution is therefore only 40% of what an insurance 
actuary would calculate the required premium to be. It is therefore not surprising that 
many pension plans are underfunded. 

Unlike insurance companies, pension funds choose to invest a significant proportion 
of their assets in equities. (A typical portfolio mix for a pension plan is 60% equity and 
40% debt.) By investing in equities, the pension fund is creating a situation where there 
is some chance that the pension plan will be fully funded. But there is also some chance 
of severe underfunding. If equity markets do well, as they have done from 1960 to 2000 
in many parts of the world, defined benefit plans find they can afford their liabilities. But 
if equity markets perform badly, there are likely to be problems. 

This raises an interesting question: Who is responsible for underfunding in defined 
benefit plans? In the first instance, it is the company’s shareholders who bear the cost. If 
the company declares bankruptcy, the cost may be borne by the government via insurance 
that is offered? In either case there is a transfer of wealth to retirees from the next 
generation. 

Many people argue that wealth transfers from one generation to another are not 
acceptable. A 25% contribution rate to pension plans is probably not feasible. If defined 
benefit plans are to continue, there must be modifications to the terms of the plans 
so that there is some risk sharing between retirees and the next generation. If equity 
markets perform badly during their working life, retirees must be prepared to accept a 
lower pension and receive only modest help from the next generation. If equity markets 
perform well, retirees can receive a full pension and some of the benefits can be passed 
on to the next generation. 

Longevity risk is a major concern for pension plans. We mentioned earlier that life 
expectancy increased by about 20 years between 1929 and 2013. If this trend continues 
and life expectancy increases by a further five years by 2029, the underfunding problems 
of defined benefit plans (both those administered by companies and those administered 
by national governments) will become more severe. It is not surprising that, in many 


3 For example, in the United States, the Pension Benefit Guaranty Corporation (PBGC) insures private 
defined benefit plans. If the premiums the PBGC receives from plans are not sufficient to meet claims, 
presumably the government would have to step in. 
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jurisdictions, individuals have the right to work past the normal retirement age. Depend- 
ing on the rules governing pensions in a particular jurisdiction, this can help solve the 
problems faced by defined benefit pension plans. An individual who retires at 70 rather 
than 65 makes an extra five years of pension contributions and the period of time for 
which the pension is received is shorter by five years. 


Summary 


There are two main types of insurance companies: life and property-casualty. Life insur- 
ance companies offer a number of products that provide a payoff when the policyholder 
dies. Term life insurance provides a payoff only if the policyholder dies during a certain 
period. Whole life insurance provides a payoff on the death of the insured, regardless of 
when this is. There is a savings element to whole life insurance. Typically, the portion of 
the premium not required to meet expected payouts in the early years of the policy is 
invested, and this is used to finance expected payouts in later years. Whole life insurance 
policies usually give rise to tax benefits, because the present value of the tax paid is less 
than it would be if the investor had chosen to invest funds directly rather than through 
the insurance policy. 

Life insurance companies also offer annuity contracts. These are contracts that, in 
return for a lump-sum payment, provide the policyholder with an annual income from a 
certain date for the rest of his or her life. Mortality tables provide important information 
for the valuation of the life insurance contracts and annuities. However, actuaries must 
consider (a) longevity risk (the possibility that people will live longer than expected) and 
(b) mortality risk (the possibility that epidemics such as AIDS or Spanish flu will reduce 
life expectancy for some segments of the population). 

Property-casualty insurance is concerned with providing protection against a loss of, 
or damage to, property. It also protects individuals and companies from legal liabilities. 
The most difficult payouts to predict are those where the same event is liable to trig- 
ger claims by many policyholders at about the same time. Examples of such events are 
hurricanes or earthquakes. 

Health insurance has some of the features of life insurance and some of the features of 
property-casualty insurance. Health insurance premiums are like life insurance premiums 
in that changes to the company’s assessment of the risk of payouts do not lead to an 
increase in premiums. However, it is like property-casualty insurance in that increases in 
the overall costs of providing health care can lead to increases in premiums. 

Two key risks in insurance are moral hazard and adverse selection. Moral hazard is 
the risk that the behavior of an individual or corporation with an insurance contract 
will be different from the behavior without the insurance contract. Adverse selection is 
the risk that the individuals and companies who buy a certain type of policy are those 
for which expected payouts are relatively high. Insurance companies take steps to reduce 
these two types of risk, but they cannot eliminate them altogether. 

Insurance companies are different from banks in that their liabilities as well as their 
assets are subject to risk. A property-casualty insurance company must typically keep 
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more equity capital, as a percent of total assets, than a life insurance company. In the 
United States, insurance companies are different from banks in that they are regulated 
at the state level rather than at the federal level. In Europe, insurance companies are 
regulated by the European Union and by national governments. The European Union 
has developed a new set of capital requirements, known as Solvency II. 

There are two types of pension plans: defined benefit plans and defined contribu- 
tion plans. Defined contribution plans are straightforward. Contributions made by an 
employee and contributions made by the company on behalf of the employee are kept 
in a separate account, invested on behalf of the employee, and converted into a life- 
time annuity when the employee retires. In a defined benefit plan, contributions from all 
employees and the company are pooled and invested. Retirees receive a pension that is 
based on the salary they earned while working. The viability of defined benefit plans is 
questionable. Many are underfunded and need superior returns from equity markets to 
pay promised pensions to both current and future retirees. 
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Practice Questions and Problems 
(Answers at End of Book) 


3.1 What is the difference between term life insurance and whole life insurance? 

3.2 Explain the meaning of variable life insurance and universal life insurance. 

3.3 Alife insurance company offers whole life and annuity contracts. In which contracts 
does it have exposure to (a) longevity risk, (b) mortality risk? 

3.4 “Equitable Life gave its policyholders a free option.’ Explain the nature of the 
option. 

3.5 Use Table 3.1 to calculate the minimum premium an insurance company should 
charge for a $1 million two-year term life insurance policy issued to a woman age 
50. Assume that the premium is paid at the beginning of each year and that the 
interest rate is zero. 
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3.12 


3.14 


3.15 


3.16 


3:17 


From Table 3.1, what is the probability that a man age 30 will live to 90? What is 

the same probability for a woman age 30? 

What features of the policies written by a property-casualty insurance company 

give rise to the most risk? 

Explain how CAT bonds work. 

Consider two bonds that have the same coupon, time to maturity, and price. One is 

a B-rated corporate bond. The other is a CAT bond. An analysis based on historical 

data shows that the expected losses on the two bonds in each year of their life is 

the same. Which bond would you advise a portfolio manager to buy, and why? 

How does health insurance in the United States differ from that in Canada and the 

United Kingdom? 

An insurance company decides to offer individuals insurance against losing their 

jobs. What problems is it likely to encounter? 

Why do property-casualty insurance companies hold more capital than life insur- 

ance companies? 

Explain what is meant by “loss ratio” and “expense ratio” for a property-casualty 

insurance company. “If an insurance company is profitable, it must be the case that 

the loss ratio plus the expense ratio is less than 100%.” Discuss this statement. 

What is the difference between a defined benefit and a defined contribution pen- 

sion plan? 

Suppose that in a certain defined benefit pension plan 

(a) Employees work for 40 years earning wages that increase with inflation. 

(b) They retire with a pension equal to 75% of their final salary. This pension also 
increases with inflation. 

(c) The pension is received for 20 years. 

(d) The pension fund’s income is invested in bonds that earn the inflation rate. 

Estimate the percentage of an employee’s salary that must be contributed to the 

pension plan if it is to remain solvent. (Hint: Do all calculations in real rather than 

nominal dollars.) 


Further Questions 


Use Table 3.1 to calculate the minimum premium an insurance company should 
charge for a $5 million three-year term life insurance contract issued to a man age 
60. Assume that the premium is paid at the beginning of each year and death always 
takes place halfway through a year. The risk-free interest rate is 6% per annum (with 
semiannual compounding). 

An insurance company’s losses of a particular type per year are to a reasonable 
approximation normally distributed with a mean of $150 million and a standard 
deviation of $50 million. (Assume that the risks taken by the insurance company are 
entirely nonsystematic.) The one-year risk-free rate is 5% per annum with annual 
compounding. Estimate the cost of the following: 
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3.18 


3.19 


(a) A contract that will pay in one-year’s time 60% of the insurance company’s costs 
on a pro rata basis. 

(b) A contract that pays $100 million in one-year’s time if losses exceed $200 
million. 

During a certain year, interest rates fall by 200 basis points (2%) and equity prices 

are flat. Discuss the effect of this on a defined benefit pension plan that is 60% 

invested in equities and 40% invested in bonds. 

Suppose that in a certain defined benefit pension plan 

(a) Employees work for 45 years earning wages that increase at a real rate of 2%. 

(b) They retire with a pension equal to 70% of their final salary. This pension 
increases at the rate of inflation minus 1%. 

(c) The pension is received for 18 years. 

(d) The pension fund’s income is invested in bonds that earn the inflation rate plus 
1.5%. 

Estimate the percentage of an employee’s salary that must be contributed to the 

pension plan if it is to remain solvent. (Hint: Do all calculations in real rather than 

nominal dollars.) 


Chapter 4 


Mutual Funds, ETFs, and 
Hedge Funds 


utual funds, exchange-traded funds (ETFs), and hedge funds invest money 
M on behalf of individuals and companies. The funds from different investors 
are pooled and investments are chosen by the fund manager in an attempt to 
meet specified objectives. Mutual funds (which are called “unit trusts” in some countries) 
and ETFs serve the needs of relatively small investors, while hedge funds seek to attract 
funds from wealthy individuals and large investors such as pension funds. Hedge funds 
are subject to much less regulation than mutual funds and ETFs. They are free to use a 
wider range of trading strategies than mutual funds and are usually more secretive about 
what they do. Mutual funds and ETFs are required to explain their investment policies 
in a prospectus that is available to potential investors. 
This chapter describes the types of mutual funds, ETFs, and hedge funds that exist. It 
examines how they are regulated and the fees they charge. It also looks at how successful 
they have been at producing good returns for investors. 


4.1 Mutual Funds 


One of the attractions of mutual funds for the small investor is the diversification oppor- 
tunities they offer. As we saw in Chapter 1, diversification improves an investor’s risk- 
return trade-off. However, it can be difficult for a small investor to hold enough stocks to 
be well diversified. In addition, maintaining a well-diversified portfolio can lead to high 
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Table 4.1 Growth of Assets of Open-End Mutual 
Funds in the United States 


Year Assets ($ billions) 
1940 0.5 
1960 17.0 
1980 134.8 
2000 6,964.6 
2016 16,343.7 


Source: Investment Company Institute. 


transaction costs. A mutual fund provides a way in which the resources of many small 
investors are pooled so that the benefits of diversification are realized at a relatively low 
cost. 

An investor in a mutual fund owns a certain number of shares in the fund. The 
most common type of mutual fund is an open-end fund. This means that the total num- 
ber of shares outstanding goes up as investors buy more shares and down as shares are 
redeemed. Open-end mutual funds have grown very fast since the Second World War. 
Table 4.1 shows estimates of the assets managed by open-end mutual funds in the United 
States since 1940. These assets were over $16 trillion by 2016. About 44% of U.S. house- 
holds own mutual funds. Some mutual funds are offered by firms that specialize in asset 
management, such as Fidelity. Others are offered by banks such as JPMorgan Chase. 
Some insurance companies also offer mutual funds. For example, in 2001, the large US. 
insurance company State Farm began offering 10 mutual funds throughout the United 
States. They can be purchased over the Internet or by phone or through State Farm 
agents. 

Money market mutual funds invest in interest-bearing instruments, such as Treasury 
bills, commercial paper, and bankers’ acceptances, with a life of less than one year. They 
are an alternative to interest-bearing bank accounts and usually provide a higher rate 
of interest because they are not insured by a government agency. Some money market 
funds offer check-writing facilities similar to banks. Money market fund investors are 
typically risk-averse and do not expect to lose any of the funds invested. In other words, 
investors expect a positive return after management fees.' In normal market conditions 
this is what they get. But occasionally the return is negative so that some principal is 
lost. This is known as “breaking the buck” because a $1 investment is then worth less 
than $1. After Lehman Brothers defaulted in September 2008, the oldest money fund 
in the United States, Reserve Primary Fund, broke the buck because it had to write 
off short-term debt issued by Lehman. To avoid a run on money market funds (which 
would have meant that healthy companies had no buyers for their commercial paper), a 
government-backed guaranty program was introduced. It lasted for about a year. 


Í Stable value funds are a popular alternative to money market funds. They typically invest in bonds 
and similar instruments with lives of up to five years. Banks and other companies provide (for a price) 
insurance guaranteeing that the return will not be negative. 
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There are three main types of long-term funds: 


1. Bond funds that invest in fixed-income securities with a life of more than one year. 
2. Equity funds that invest in common and preferred stock. 
3. Hybrid funds that invest in stocks, bonds, and other securities. 


Equity mutual funds are by far the most popular. 

Mutual funds are valued at 4 p.m. each day. This involves the mutual fund manager 
calculating the market value of each asset in the portfolio so that the total value of the 
fund is determined. This total value is divided by the number of shares outstanding to 
obtain the value of each share. The latter is referred to as the net asset value (NAV) of the 
fund. Shares in the fund can be bought from the fund or sold back to the fund at any 
time. When an investor issues instructions to buy or sell shares, it is the next-calculated 
NAV that applies to the transaction. For example, if an investor decides to buy at 2 P.M. 
on a particular business day, the NAV at 4 P.M. on that day determines the amount paid 
by the investor. 

The investor usually pays tax as though he or she owned the securities in which the 
fund has invested. Thus, when the fund receives a dividend, an investor in the fund has 
to pay tax on the investor’s share of the dividend, even if the dividend is reinvested in 
the fund for the investor. When the fund sells securities, the investor is deemed to have 
realized an immediate capital gain or loss, even if the investor has not sold any of his or 
her shares in the fund. Suppose the investor buys shares at $100 and the trading by the 
fund leads to a capital gain of $20 per share in the first tax year and a capital loss of $25 
per share in the second tax year. The investor has to declare a capital gain of $20 in the 
first year and a loss of $25 in the second year. When the investor sells the shares, there is 
also a capital gain or loss. To avoid double counting, the purchase price of the shares is 
adjusted to reflect the capital gains and losses that have already accrued to the investor. 
Thus, if in our example the investor sold shares in the fund during the second year, the 
purchase price would be assumed to be $120 for the purpose of calculating capital gains 
or losses on the transaction during the second year; if the investor sold the shares in 
the fund during the third year, the purchase price would be assumed to be $95 for the 
purpose of calculating capital gains or losses on the transaction during the third year. 


4.1.1 Index Funds 


Some funds are designed to track a particular equity index, such as the S&P 500 or 
the FTSE 100. The tracking can most simply be achieved by buying all the shares in 
the index in amounts that reflect their weight. For example, if IBM has 1% weight in a 
particular index, 1% of the tracking portfolio for the index would be invested in IBM 
stock. Another way of achieving tracking is to choose a smaller portfolio of representative 
shares that has been shown by research to track the chosen portfolio closely. Yet another 
way is to use index futures. 

One of the first index funds was launched in the United States on December 31, 
1975, by John Bogle to track the S&P 500. It started with only $11 million of assets and 
was initially ridiculed as being “un-American” and “Bogle’s folly.’ However, it has been 
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hugely successful and has been renamed the Vanguard 500 Index Fund. Its assets were 
over $500 billion in 2017. 

How accurately do index funds track the index? Two relevant measures are the track- 
ing error and the expense ratio. The tracking error of a fund can be defined as either the 
root mean square error of the difference between the fund’s return per year and the index 
return per year or as the standard deviation of this difference” The expense ratio is the 
fee charged per year, as a percentage of assets, for administering the fund. 


4.1.2 Costs 


Mutual funds incur a number of different costs. These include management expenses, 
sales commissions, accounting and other administrative costs, transaction costs on trades, 
and so on. To recoup these costs, and to make a profit, fees are charged to investors. A 
front-end load is a fee charged when an investor first buys shares in a mutual fund. Not all 
funds charge this type of fee. Those that do are referred to as front-end loaded. In the 
United States, front-end loads are restricted to being less than 8.5% of the investment. 
Some funds charge fees when an investor sells shares. These are referred to as a back- 
end load. Typically the back-end load declines with the length of time the shares in the 
fund have been held. All funds charge an annual fee. There may be separate fees to cover 
management expenses, distribution costs, and so on. The total expense ratio is the total of 
the annual fees charged per share divided by the value of the share. 

Khorana et al. (2009) compared the mutual fund fees in 18 different countries? The 
researchers assume in their analysis that a fund is kept for five years. The total shareholder 
cost per year is calculated as 


Front-end load i Back-end load 
5 5 
Their results are summarized in Table 4.2. The average fees for equity funds vary from 


1.41% in Australia to 3.00% in Canada. Fees for equity funds are on average about 50% 
higher than for bond funds. Index funds tend to have lower fees than regular funds 


Total expense ratio + 


because no highly paid stock pickers or analysts are required. For some index funds in 
the United States, fees are as low as 0.15% per year. 


4.1.3 Closed-End Funds 


The funds we have talked about so far are open-end funds. These are by far the most 
common type of fund. The number of shares outstanding varies from day to day as 
individuals choose to invest in the fund or redeem their shares. Closed-end funds are like 
regular corporations and have a fixed number of shares outstanding. The shares of the 
fund are traded on a stock exchange. For closed-end funds, two NAVs can be calculated. 


? The root mean square error of the difference (square root of the average of the squared differences) 
is a better measure. The trouble with standard deviation is that it is low when the error is large but 
fairly constant. 

3 See A. Khorana, H. Servaes, and P. Tufano, “Mutual Fund Fees Around the World,” Review of Financial 
Studies 22 (March 2009): 1279—1310. 
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Table 4.2 Average Total Cost per Year When Mutual Fund Is 
Held for Five Years (% of assets) 


Country Bond Funds Equity Funds 
Australia 0.75 1.41 
Austria 1.55 2.37 
Belgium 1.60 2.27 
Canada 1.84 3.00 
Denmark 1.91 2.62 
Finland 1.76 2.77 
France 1.57 2.31 
Germany 1.48 2.29 
Italy 1.56 2.58 
Luxembourg 1.62 2.43 
Netherlands 1.73 2.46 
Norway 1.77 2.67 
Spain 1.58 2.70 
Sweden 1.67 2.47 
Switzerland 1.61 2.40 
United Kingdom 1.73 2.48 
United States 1.05 1.53 
Average 1.39 2.09 


Source: A. Khorana, H. Servaes, and P. Tufano, “Mutual Fund Fees Around the 
World,” Review of Financial Studies 22 (March 2009): 1279-1310. 


One is the price at which the shares of the fund are trading. The other is the market value 
of the fund’s portfolio divided by the number of shares outstanding. The latter can be 
referred to as the fair market value. Usually a closed-end fund’s share price is less than its 
fair market value. A number of researchers have investigated the reason for this. Research 
by Ross (2002) suggests that the fees paid to fund managers provide the explanation 4 
Closed-end funds are much less popular than open-end funds. In 2016, their assets in the 
United States totaled $262 billion. 


4.2 Exchange-Traded Funds 


Exchange-traded funds (ETFs) have existed in the United States since 1993 and in Europe 
since 1999. They often track an index and so are an alternative to an index mutual fund 
for investors who are comfortable earning a return that is designed to mirror the index. 
One of the most widely known ETFs, called the Spider, tracks the S&P 500 and trades 
under the symbol SPY. In a survey of investment professionals conducted in March 2008, 
67% called ETFs the most innovative investment vehicle of the previous two decades and 
60% reported that ETFs have fundamentally changed the way they construct investment 
portfolios. In 2008, the SEC in the United States authorized the creation of actively 
managed ETFs. 


+See S. Ross, “Neoclassical Finance, Alternative Finance and the Closed End Fund Puzzle,” European 
Financial Management 8 (2002): 129-137. 
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ETFs are created by institutional investors. Typically, an institutional investor deposits 
a block of securities with the ETF and obtains shares in the ETF (known as creation units) 
in return. Some or all of the shares in the ETF are then traded on a stock exchange. 
This gives ETFs the characteristics of a closed-end fund rather than an open-end fund. 
However, a key feature of ETFs is that institutional investors can exchange large blocks 
of shares in the ETF for the assets underlying the shares at that time. They can give up 
shares they hold in the ETF and receive the assets or they can deposit new assets and 
receive new shares. This ensures that there is never any appreciable difference between 
the price at which shares in the ETF are trading on the stock exchange and their fair 
market value. This is a key difference between ETFs and closed-end funds and makes 
ETFs more attractive to investors than closed-end funds. 

ETFs have a number of advantages over open-end mutual funds. ETFs can be bought 
or sold at any time of the day. They can be shorted in the same way that shares in any 
stock are shorted. (See Chapter 5 for a discussion of short selling.) ETF holdings are 
disclosed twice a day, giving investors full knowledge of the assets underlying the fund. 
Mutual funds by contrast only have to disclose their holdings relatively infrequently. 
When shares in a mutual fund are sold, managers often have to sell the stocks in which 
the fund has invested to raise the cash that is paid to the investor. When shares in the 
ETF are sold, this is not necessary as another investor is providing the cash. This means 
that transaction costs are saved and there are less unplanned capital gains and losses passed 
on to shareholders. Finally, the expense ratios of ETFs tend to be lower than those of 
mutual funds. The popularity of ETFs is increasing. By 2016, their assets had reached 
$2.5 trillion. 


4.3 Active vs. Passive Management 


When buying a mutual fund or an ETE an investor has a choice between a passively 
managed fund that is designed to track an index such as the S&P 500 and an actively 
managed fund that relies on the stock selection and timing skills of the fund manager. 
Actively managed funds tend to have much higher expense ratios. A key question there- 
fore is whether actively managed mutual funds outperform stock indices such as the S&P 
500. Some funds in some years do very well, but this could be the result of good luck 
rather than good investment management. Two key questions for researchers are: 


1. Do actively managed funds outperform stock indices on average? 
2. Do funds that outperform the market in one year continue to do so? 


The answer to both questions appears to be no. In a classic study, Jensen (1969) per- 


formed tests on mutual fund performance using 10 years of data on 115 funds? He 
calculated the alpha for each fund in each year. (As explained in Section 1.3, alpha is 


>See M. C. Jensen, “Risk, the Pricing of Capital Assets and the Evaluation of Investment Portfolios,” 
Journal of Business 42 (April 1969): 167-247. 
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Table 4.3 Consistency of Good Performance by Mutual Funds 


Number of Percentage of 
Consecutive Years Number of Observations When 
of Positive Alpha Observations Next Alpha Is Positive 
1 574 50.4 

2 312 52.0 

3 161 53.4 

4 79 55.8 

5 41 46.4 

6 17 35.3 


the return earned in excess of that predicted by the capital asset pricing model.) The 
average alpha was about zero before all expenses and negative after expenses were con- 
sidered. Jensen tested whether funds with positive alphas tended to continue to earn 
positive alphas. His results are summarized in Table 4.3. The first row shows that 574 
positive alphas were observed from the 1,150 observations (close to 50%). Of these posi- 
tive alphas, 50.4% were followed by another year of positive alpha. Row two shows that, 
when two years of positive alphas have been observed, there is a 52% chance that the 
next year will have a positive alpha, and so on. The results show that, when a manager has 
achieved above-average returns for one year (or several years in a row), there is still only 
a probability of about 50% of achieving above-average returns the next year. The results 
suggest that managers who obtain positive alphas do so because of luck rather than skill. 
It is possible that there are some managers who are able to perform consistently above 
average, but they are a very small percentage of the total. More recent studies have con- 
firmed Jensen’s conclusions. On average, mutual fund managers do not beat the market 
and past performance is not a good guide to future performance. The success of index 
funds shows that this research has influenced the views of many investors. 

Mutual funds frequently advertise impressive returns. However, the fund being fea- 
tured might be one fund out of many offered by the same organization that happens to 
have produced returns well above the average for the market. Distinguishing between 
good luck and good performance is always tricky. Suppose an asset management com- 
pany has 32 funds following different trading strategies and assume that the fund managers 
have no particular skills, so that the return of each fund has a 50% chance of being greater 
than the market each year. The probability of a particular fund beating the market every 
year for the next five years is (1/2)° or 1/32. This means that by chance one out of the 
32 funds will show a great performance over the five-year period! 

One point should be made about the way returns over several years are expressed. 
One mutual fund might advertise “The average of the returns per year that we have 
achieved over the last five years is 15%.’ Another might say “If you had invested your 
money in our mutual fund for the last five years your money would have grown at 
15% per year.” These statements sound the same, but are actually different, as illustrated 
by Business Snapshot 4.1. In many countries, regulators have strict rules to ensure that 
mutual fund returns are not reported in a misleading way. 
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BUSINESS SNAPSHOT 4.1 
Mutual Fund Returns Can Be Misleading 


Suppose that the following is a sequence of returns per annum reported 
by a mutual fund manager over the last five years (measured using annual 
compounding): 


15%, 20%, 30%, -20%, 25% 


The arithmetic mean of the returns, calculated by taking the sum of the returns 
and dividing by 5, is 14%. However, an investor would actually earn less than 
14% per annum by leaving the money invested in the fund for five years. The 
dollar value of $100 at the end of the five years would be 


100 x 1.15 x 1.20 x 1.30 x 0.80 x 1.25 = $179.40 
By contrast, a 14% return (with annual compounding) would give 
100 x 1.14° = $192.54 
The return that gives $179.40 at the end of five years is 12.4%. This is because 
100 x (1.124)? = 179.40 


What average return should the fund manager report? It is tempting for the 
manager to make a statement such as: “The average of the returns per year that 
we have realized in the last five years is 14%.” Although true, this is misleading. 
It is much less misleading to say: “The average return realized by someone who 
invested with us for the last five years is 12.4% per year.” In some jurisdictions, 
regulations require fund managers to report returns the second way. 

This phenomenon is an example of a result that is well known by mathe- 
maticians. The geometric mean of a set of numbers (not all the same) is always 
less than the arithmetic mean. In our example, the return multipliers each year 
are 1.15, 1.20, 1.30, 0.80, and 1.25. The arithmetic mean of these numbers is 
1.140, but the geometric mean is only 1.124. An investor who keeps an invest- 
ment for several years earns a return corresponding to the geometric mean, not 
the arithmetic mean. 


4.4 Regulation 


Because they solicit funds from small retail customers, many of whom are unsophisti- 
cated, mutual funds and ETFs are heavily regulated. The SEC is the primary regulator 
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of these funds in the United States. The funds must file a registration document with 
the SEC. Full and accurate financial information must be provided to prospective fund 
purchasers in a prospectus. There are rules to prevent conflicts of interest, fraud, and 
excessive fees. 

Despite the regulations, there have been a number of scandals involving mutual funds. 
One of these involves late trading. As mentioned earlier in this chapter, if a request to buy 
or sell mutual fund shares is placed by an investor with a broker by 4 P.M. on any given 
business day, it is the NAV of the fund at 4 p.m. that determines the price that is paid 
or received by the investor. In practice, for various reasons, an order to buy or sell is 
sometimes not passed from a broker to a mutual fund until later than 4 r.m. This allows 
brokers to collude with investors and submit new orders or change existing orders after 
4 pM. The NAV of the fund at 4 p.m. still applies to the investor—even though he or she 
may be using information on market movements (particularly movements in overseas 
markets) after 4 P.M. Late trading is not permitted under SEC regulations, and there were 
a number of prosecutions in the early 2000s that led to multimillion-dollar fine payments 
and employees being fired. 

Another scandal is known as market timing. This is a practice where favored clients 
are allowed to buy and sell mutual funds shares frequently (e.g., every few days) and 
in large quantities without penalty. One reason why they might want to do this is that 
they are indulging in the illegal practice of late trading. Another reason is that they are 
analyzing the impact of stocks whose prices have not been updated recently on the 
fund’s NAV. Suppose that the price of a stock has not been updated for several hours. 
(This could be because it does not trade frequently or because it trades on an exchange 
in a country in a different time zone.) If the U.S. market has gone up (down) in the 
last few hours, the calculated NAV is likely to understate (overstate) the value of the 
underlying portfolio and there is a short-term trading opportunity. Taking advantage of 
this is not necessarily illegal. However, it may be illegal for the mutual fund to offer special 
trading privileges to favored customers because the costs (such as those associated with 
providing the liquidity necessary to accommodate frequent redemptions) are borne by all 
customers. 

Other scandals have involved front running and directed brokerage. Front running occurs 
when a mutual fund is planning a big trade that is expected to move the market. It informs 
favored customers or partners before executing the trade, allowing them to trade for 
their own account first. Directed brokerage involves an improper arrangement between 
a mutual fund and a brokerage house where the brokerage house recommends the mutual 
fund to clients in return for receiving orders from the mutual fund for stock and bond 
trades. 


4.5 Hedge Funds 


Hedge funds are different from mutual funds in that they are subject to very little regula- 
tion. This is because they accept funds only from financially sophisticated individuals and 
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organizations. Examples of the regulations that affect mutual funds are the requirements 
that: 


e Shares be redeemable at any time 
e NAV be calculated daily 

e Investment policies be disclosed 
e The use of leverage be limited 


Hedge funds are largely free from these regulations. This gives them a great deal of 
freedom to develop sophisticated, unconventional, and proprietary investment strategies. 
Hedge funds are sometimes referred to as alternative investments. 

The first hedge fund, A. W. Jones & Co., was created by Alfred Winslow Jones in the 
United States in 1949. It was structured as a general partnership to avoid SEC regula- 
tions. Jones combined long positions in stocks considered to be undervalued with short 
positions in stocks considered to be overvalued. He used leverage to magnify returns. A 
performance fee equal to 20% of profits was charged to investors. The fund performed 
well and the term “hedge fund” was coined in a newspaper article written about A. W. 
Jones & Co. by Carol Loomis in 1966. The article showed that the fund’s performance 
after allowing for fees was better than the most successful mutual funds. Not surprisingly, 
the article led to a great deal of interest in hedge funds and their investment approach. 
Other hedge fund pioneers were George Soros, Walter J. Schloss, and Julian Robertson® 

“Hedge fund” implies that risks are being hedged. Jones’s trading strategy did involve 
hedging. He had little exposure to the overall direction of the market because his long 
position (in stocks considered to be undervalued) at any given time was about the same 
size as his short position (in stocks considered to be overvalued). However, for some 
hedge funds, the word “hedge” is inappropriate because they take aggressive bets on the 
future direction of the market with no particular hedging policy. 

Hedge funds have grown in popularity over the years, and it is estimated that more 
than $3 trillion was invested with them throughout the world in 2017. The success 
of hedge funds is a little surprising because, as we will see later, they have performed 
less well than the S&P 500 between 2009 and 2016. Many hedge funds are registered in 
tax-favorable jurisdictions such as the Cayman Islands. Funds of funds have been set up to 
allocate funds to different hedge funds. Hedge funds are difficult to ignore. They account 
for a large part of the daily turnover on the New York and London stock exchanges. 
They are major players in the convertible bond, credit default swap, distressed debt, and 
non-investment-grade bond markets. They are also active participants in the ETF market, 
often taking short positions. 


©The famous investor Warren Buffett can also be considered a hedge fund pioneer. In 1956, he started 
Buffett Partnership LP with seven limited partners and $100,100. Buffett charged his partners 25% of 
profits above a hurdle rate of 25%. He searched for unique situations, merger arbitrage, spin-offs, and 
distressed debt opportunities and earned an average of 29.5% per year. The partnership was disbanded 
in 1969, and Berkshire Hathaway (a holding company, not a hedge fund) was formed. 
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4.5.1 Fees 


One characteristic of hedge funds that distinguishes them from mutual funds is that fees 
are higher and dependent on performance. An annual management fee that is usually 
between 1% and 3% of assets under management is charged. This is designed to meet 
operating costs—but there may be an additional fee for things such as audits, account 
administration, and trader bonuses. Moreover, an incentive fee that is usually between 
15% and 30% of realized net profits (1.e., profits after management fees) is charged if the 
net profits are positive. This fee structure is designed to attract the most talented and 
sophisticated investment managers. Thus, a typical hedge fund fee schedule might be 
expressed as “2 plus 20%” indicating that the fund charges 2% per year of assets under 
management and 20% of net profit. On top of high fees there is usually a lock-up period 
of at least one year during which invested funds cannot be withdrawn. Some hedge 
funds with good track records have sometimes charged much more than the average. An 
example is James Simons’s Renaissance Technologies Corp., which has charged as much 
as “5 plus 44%.” (James Simons is a former math professor whose wealth was estimated 
at $18 billion in 2017.) 

The agreements offered by hedge funds may include clauses that make the incentive 
fees more palatable. For example: 


e There is sometimes a hurdle rate. This is the minimum return necessary for the incentive 
fee to be applicable. 


There is sometimes a high-water mark clause. This states that any previous losses must 
be recouped by new profits before an incentive fee applies. Because different investors 
place money with the fund at different times, the high-water mark is not necessarily 
the same for all investors. There may be a proportional adjustment clause stating that, if 
funds are withdrawn by investors, the amount of previous losses that has to be recouped 
is adjusted proportionally. Suppose a fund worth $200 million loses $40 million and 
$80 million of funds are withdrawn. The high-water mark clause on its own would 
require $40 million of profits on the remaining $80 million to be achieved before the 
incentive fee applied. The proportional adjustment clause would reduce this to $20 
million because the fund is only half as big as it was when the loss was incurred. 


There is sometimes a clawback clause that allows investors to apply part or all of previous 
incentive fees to current losses. A portion of the incentive fees paid by the investor each 
year is then retained in a recovery account. This account is used to compensate investors 
for a percentage of any future losses. 


Some hedge fund managers have become very rich from the generous fee schedules. 
Forbes estimates that the earnings of the top 25 hedge fund managers in 2016 totaled $10.9 
billion. It is estimated that two hedge fund managers earned over $1 billion: James Simons 
(Renaissance Technologies, $1.6 billion) and Ray Dalio (Bridgewater, $1.4 billion). 

If an investor has a portfolio of investments in hedge funds, the fees paid can be 
quite high. As a simple example, suppose that an investment is divided equally between 
two funds, A and B. Both funds charge 2 plus 20%. In the first year, Fund A earns 20% 
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while Fund B earns —10%. The investor’s average return on investment before fees is 
0.5 X 20% + 0.5 x (—10%) or 5%. The fees paid to fund A are 2% + 0.2 x (20 — 2)% or 
5.6%. The fees paid to Fund B are 2%. The average fee paid on the investment in the 
hedge funds is therefore 3.8%. The investor is left with a 1.2% return. This is half of what 
the investor would get if 2 plus 20% were applied to the overall 5% return. 

When a fund of funds is involved, there is an extra layer of fees and the investor’s 
return after fees is even worse. A typical fee charged by a fund of hedge funds used to 
be 1% of assets under management plus 10% of the net (after management and incentive 
fees) profits of the hedge funds they invest in. These fees have gone down as a result of 
poor hedge fund performance. Suppose a fund of hedge funds divides its money equally 
between 10 hedge funds. All charge 2 plus 20% and the fund of hedge funds charges 1 
plus 10%. It sounds as though the investor pays 3 plus 30%—but it can be much more 
than this. Suppose that five of the hedge funds lose 40% before fees and the other five 
make 40% before fees. An incentive fee of 20% of 38% or 7.6% has to be paid to each of 
the profitable hedge funds. The total incentive fee is therefore 3.8% of the funds invested. 
In addition there is a 2% annual fee paid to the hedge funds and 1% annual fee paid to 
the fund of funds. The investor’s net return is —6.8% of the amount invested. (This is 
6.8% less than the return on the underlying assets before fees.) 


4.5.2 Incentives of Hedge Fund Managers 


The fee structure gives hedge fund managers an incentive to make a profit. But it also 
encourages them to take risks. The hedge fund manager has a call option on the assets 
of the fund. As is well known, the value of a call option increases as the volatility of the 
underlying assets increases. This means that the hedge fund manager can increase the 
value of the option by taking risks that increase the volatility of the fund’s assets. The 
fund manager has a particular incentive to do this when nearing the end of the period 
over which the incentive fee is calculated and the return to date is low or negative. 
Suppose that a hedge fund manager is presented with an opportunity where there is 
a 0.4 probability of a 60% profit and a 0.6 probability of a 60% loss with the fees earned 
by the hedge fund manager being 2 plus 20%. The expected return on the investment is 


0.4 x 60% + 0.6 x (-60%) 


or —12%. 

Even though this is a terrible expected return, the hedge fund manager might be 
tempted to accept the investment. If the investment produces a 60% profit, the hedge 
fund’s fee is 2+ 0.2 X 58 or 13.6%. If the investment produces a 60% loss, the hedge 
fund’s fee is 2%. The expected fee to the hedge fund is therefore 


0.4 X 13.6 + 0.6 X 2 = 6.64 


or 6.64% of the funds under administration. The expected management fee is 2% and 
the expected incentive fee is 4.64%. 
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Table 4.4 Return from a High-Risk Investment 
Where Returns of +60% and —60% Have 
Probabilities of 0.4 and 0.6, Respectively, and the 
Hedge Fund Charges 2 Plus 20% 


Expected return to hedge fund 6.64% 
Expected return to investors — 18.64% 
Overall expected return —12.00% 


To the investors in the hedge fund, the expected return is 
0.4 x (60 — 0.2 x 58 — 2) + 0.6 x (—60 — 2) = —18.64 


or —18.64%. 

The example is summarized in Table 4.4. It shows that the fee structure of a hedge 
fund gives its managers an incentive to take high risks even when expected returns are 
negative. The incentives may be reduced by hurdle rates, high—water mark clauses, and 
clawback clauses. However, these clauses are not always as useful to investors as they 
sound. One reason is that investors have to continue to invest with the fund to take 
advantage of them. Another is that, as losses mount for a hedge fund, the hedge fund 
managers have an incentive to wind up the hedge fund and start a new one. 

The incentives we are talking about here are real. Imagine how you would feel as 
an investor in the hedge fund Amaranth. One of its traders, Brian Hunter, liked to make 
huge bets on the price of natural gas. Until 2006, his bets were largely right and as a result 
he was regarded as a star trader. His remuneration including bonuses is reputed to have 
been close to $100 million in 2005. During 2006, his bets proved wrong and Amaranth, 
which had about $9 billion of assets under administration, lost a massive $6.5 billion. (This 
was even more than the loss of hedge fund Long-Term Capital Management in 1998.) 
Brian Hunter did not have to return the bonuses he had previously earned. Instead, he 
left Amaranth and tried to start his own hedge fund. 

It is interesting to note that, in theory, two individuals can create a money machine as 
follows. One starts a hedge fund with a certain high risk (and secret) investment strategy. 
The other starts a hedge fund with an investment strategy that is the opposite of that 
followed by the first hedge fund. For example, if the first hedge fund decides to buy 
$1 million of silver, the second hedge fund shorts this amount of silver. At the time 
they start the funds, the two individuals enter into an agreement to share the incentive 
fees. One hedge fund (we do not know which one) is likely to do well and earn good 
incentive fees. The other will do badly and earn no incentive fees. Provided that they 
can find investors for their funds, they have a money machine! 


4.5.3 Prime Brokers 


Prime brokers are the banks that offer services to hedge funds. Typically a hedge fund, 
when it is first started, will choose a particular bank as its prime broker. This bank handles 
the hedge fund’s trades (which may be with the prime broker or with other financial 
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institutions), carries out calculations each day to determine the collateral the hedge fund 
has to provide, borrows securities for the hedge fund when it wants to take short posi- 
tions, provides cash management and portfolio reporting services, and makes loans to the 
hedge fund. In some cases, the prime broker provides risk management and consulting 
services and introduces the hedge fund to potential investors. The prime broker has a 
good understanding of the hedge fund’s portfolio and will typically carry out stress tests 
on the portfolio to decide how much leverage it is prepared to offer the fund. 

Although hedge funds are not heavily regulated, they do have to answer to their 
prime brokers. The prime broker is the main source of borrowed funds for a hedge fund. 
The prime broker monitors the risks being taken by the hedge fund and determines how 
much the hedge fund 1s allowed to borrow. Typically a hedge fund has to post securities 
with the prime broker as collateral for its loans. When it loses money, more collateral 
has to be posted. If it cannot post more collateral, it has no choice but to close out its 
trades. One thing the hedge fund has to think about is the possibility that it will enter 
into a trade that is correct in the long term, but loses money in the short term. Consider 
a hedge fund that thinks credit spreads are too high. It might be tempted to take a highly 
leveraged position where BBB-rated bonds are bought and Treasury bonds are shorted. 
However, there is the danger that credit spreads will increase before they decrease. In this 
case, the hedge fund might run out of collateral and be forced to close out its position 
at a huge loss. 

As a hedge fund gets larger, it is likely to use more than one prime broker. This means 
that no one bank sees all its trades and has a complete understanding of its portfolio. The 
opportunity of transacting business with more than one prime broker gives a hedge fund 
more negotiating clout to reduce the fees it pays. Goldman Sachs, Morgan Stanley, and 
many other large banks offer prime broker services to hedge funds and find them to be 
an important contributor to their profits.’ 


4.6 Hedge Fund Strategies 


In this section we will discuss the strategies followed by hedge funds. Our classification is 
similar to the one used by the Barclay Hedge Fund Indices, which provides indices track- 
ing hedge fund performance. Not all hedge funds can be classified in the way indicated. 
Some follow more than one of the strategies mentioned and some follow strategies that 
are not listed. (For example, there are funds specializing in weather derivatives.) 


4.6.1 Long/Short Equity 


As described earlier, long/short equity strategies were used by hedge fund pioneer Alfred 
Winslow Jones. They continue to be among the most popular of hedge fund strategies. 


7 Although a bank is taking some risks when it lends to a hedge fund, it is also true that a hedge fund 
is taking some risks when it chooses a prime broker. Many hedge funds that chose Lehman Brothers 
as their prime broker found that they could not access assets, which they had placed with Lehman 
Brothers as collateral, when the company went bankrupt in 2008. 
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The hedge fund manager identifies a set of stocks that are considered to be undervalued 
by the market and a set that are considered to be overvalued. The manager takes a long 
position in the first set and a short position in the second set. Typically, the hedge fund has 
to pay the prime broker a fee (perhaps 1% per year) to rent the shares that are borrowed 
for the short position. (See Chapter 5 for a discussion of short selling.) 

Long/short equity strategies are all about stock picking. If the overvalued and under- 
valued stocks have been picked well, the strategies should give good returns in both bull 
and bear markets. Hedge fund managers often concentrate on smaller stocks that are not 
well covered by analysts and research the stocks extensively using fundamental analysis, 
as pioneered by Benjamin Graham. The hedge fund manager may choose to maintain 
a net long bias where the shorts are of smaller magnitude than the longs or a net short 
bias where the reverse is true. Alfred Winslow Jones maintained a net long bias in his 
successful use of long/short equity strategies. 

An equity-market-neutral fund is one where longs and shorts are matched in some 
way. A dollar-neutral fund is an equity-market-neutral fund where the dollar amount of 
the long position equals the dollar amount of the short position. A beta-neutral fund is 
a more sophisticated equity-market-neutral fund where the weighted average beta of 
the shares in the long portfolio equals the weighted average beta of the shares in the 
short portfolio so that the overall beta of the portfolio is zero. If the capital asset pricing 
model is true, the beta-neutral fund should be totally insensitive to market movements. 
Long and short positions in index futures are sometimes used to maintain a beta-neutral 
position. 

Sometimes equity-market-neutral funds go one step further. They maintain sector 
neutrality where long and short positions are balanced by industry sectors or factor neutrality 
where the exposure to factors such as the price of oil, the level of interest rates, or the 
rate of inflation is neutralized. 


4.6.2 Dedicated Short 


Managers of dedicated short funds look exclusively for overvalued companies and sell 
them short. They are attempting to take advantage of the fact that brokers and analysts are 
reluctant to issue sell recommendations—even though one might reasonably expect the 
number of companies overvalued by the stock market to be approximately the same as 
the number of companies undervalued at any given time. Typically, the companies chosen 
are those with weak financials, those that change their auditors regularly, those that delay 
filing reports with the SEC, companies in industries with overcapacity, companies suing 
or attempting to silence their short sellers, and so on. 


4.6.3 Distressed Securities 


Bonds with credit ratings of BB or lower are known as “non-investment-grade” or “junk” 
bonds. Those with a credit rating of CCC are referred to as “distressed” and those with 
a credit rating of D are in default. Typically, distressed bonds sell at a big discount to their 
par value and provide a yield that is over 1,000 basis points (10%) more than the yield on 
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Treasury bonds. Of course, an investor only earns this yield if the required interest and 
principal payments are actually made. 

The managers of funds specializing in distressed securities carefully calculate a fair 
value for distressed securities by considering possible future scenarios and their prob- 
abilities. Distressed debt cannot usually be shorted and so they search for debt that is 
undervalued by the market. Bankruptcy proceedings usually lead to a reorganization or 
liquidation of a company. The fund managers understand the legal system, know priori- 
ties in the event of liquidation, estimate recovery rates, consider actions likely to be taken 
by management, and so on. 

Some funds are passive investors. They buy distressed debt when the price is below its 
fair value and wait. Other hedge funds adopt an active approach. They might purchase a 
sufficiently large position in outstanding debt claims so that they have the right to influ- 
ence a reorganization proposal. In Chapter 11 reorganizations in the United States, each 
class of claims must approve a reorganization proposal with a two-thirds majority. This 
means that one-third of an outstanding issue can be sufficient to stop a reorganization 
proposed by management or other stakeholders. In a reorganization of a company, the 
equity is often worthless and the outstanding debt is converted into new equity. Some- 
times, the goal of an active manager is to buy more than one-third of the debt, obtain 
control of a target company, and then find a way to extract wealth from it. 


4.6.4 Merger Arbitrage 


Merger arbitrage involves trading after a merger or acquisition is announced in the hope 
that the announced deal will take place. There are two main types of deals: cash deals and 
share-for-share exchanges. 

Consider first cash deals. Suppose that Company A announces that it is prepared to 
acquire all the shares of Company B for $30 per share. Suppose the shares of Company 
B were trading at $20 prior to the announcement. Immediately after the announcement 
its share price might jump to $28. It does not jump immediately to $30 because (a) there 
is some chance that the deal will not go through and (b) it may take some time for the 
full impact of the deal to be reflected in market prices. Merger-arbitrage hedge funds 
buy the shares in company B for $28 and wait. If the acquisition goes through at $30, 
the fund makes a profit of $2 per share. If it goes through at a higher price, the profit is 
higher. However, if for any reason the deal does not go through, the hedge fund will take 
a loss. 

Consider next a share-for-share exchange. Suppose that Company A announces that 
it is willing to exchange one of its shares for four of Company B’s shares. Assume that 
Company B’s shares were trading at 15% of the price of Company A’s shares prior to the 
announcement. After the announcement, Company B’s share price might rise to 22% of 
Company A’s share price. A merger-arbitrage hedge fund would buy a certain amount of 
Company B’s stock and at the same time short a quarter as much of Company A’ stock. 
This strategy generates a profit if the deal goes ahead at the announced share-for-share 
exchange ratio or one that is more favorable to Company B. 
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Merger-arbitrage hedge funds can generate steady, but not stellar, returns. It is impor- 
tant to distinguish merger arbitrage from the activities of Ivan Boesky and others who 
used inside information to trade before mergers became public knowledge® Trading on 
inside information is illegal. Ivan Boesky was sentenced to three years in prison and fined 
$100 million. 


4.6.5 Convertible Arbitrage 


Convertible bonds are bonds that can be converted into the equity of the bond issuer at 
certain specified future times with the number of shares received in exchange for a bond 
possibly depending on the time of the conversion. The issuer usually has the right to 
call the bond (i.e., buy it back for a prespecified price) in certain circumstances. Usually, 
the issuer announces its intention to call the bond as a way of forcing the holder to 
convert the bond into equity immediately. (If the bond is not called, the holder is likely 
to postpone the decision to convert it into equity for as long as possible.) 

A convertible arbitrage hedge fund has typically developed a sophisticated model 
for valuing convertible bonds. The convertible bond price depends in a complex way 
on the price of the underlying equity, its volatility, the level of interest rates, and the 
chance of the issuer defaulting. Many convertible bonds trade at prices below their fair 
value. Hedge fund managers buy the bond and then hedge their risks by shorting the 
stock. (This is an application of delta hedging, which will be discussed in Chapter 8.) 
Interest rate risk and credit risk can be hedged by shorting nonconvertible bonds that 
are issued by the company that issued the convertible bond. Alternatively, the managers 
can take positions in interest rate futures contracts, asset swaps, and credit default swaps 
to accomplish this hedging. 


4.6.6 Fixed Income Arbitrage 


The basic tool of fixed income trading is the zero-coupon yield curve, the construction 
of which is discussed in Appendix B. One strategy followed by hedge fund managers that 
engage in fixed-income arbitrage is a relative value strategy, where they buy bonds that 
the zero-coupon yield curve indicates are undervalued by the market and sell bonds that 
it indicates are overvalued. Market-neutral strategies are similar to relative value strategies 
except that the hedge fund manager tries to ensure that the fund has no exposure to 
interest rate movements. 

Some fixed-income hedge fund managers follow directional strategies where they 
take a position based on a belief that a certain spread between interest rates, or interest 
rates themselves, will move in a certain direction. Usually they have a lot of leverage and 
have to post collateral. They are therefore taking the risk that they are right in the long 
term, but that the market moves against them in the short term so that they cannot post 


8 The Michael Douglas character of Gordon Gekko in the award-winning movie Wall Street was based 
on Ivan Boesky. 
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collateral and are forced to close out their positions at a loss. This is what happened to 
Long-Term Capital Management (see Business Snapshot 22.1). 


4.6.7 Emerging Markets 


Emerging market hedge funds specialize in investments associated with developing coun- 
tries. Some of these funds focus on equity investments. They screen emerging market 
companies looking for shares that are overvalued or undervalued. They gather informa- 
tion by traveling, attending conferences, meeting with analysts, talking to management, 
and employing consultants. Usually they invest in securities trading on the local exchange, 
but sometimes they use American Depository Receipts (ADRs). ADRs are certificates 
issued in the United States and traded on a U.S. exchange. They are backed by shares of 
a foreign company. ADRs may have better liquidity and lower transaction costs than the 
underlying foreign shares. Sometimes there are price discrepancies between ADRs and 
the underlying shares giving rise to arbitrage opportunities. 

Another type of investment is debt issued by an emerging market country. Eurobonds 
are bonds issued by the country and denominated in a hard currency such as the U.S. 
dollar or the euro. Local currency bonds are bonds denominated in the local currency. 
Hedge funds invest in both types of bonds. They can be risky: countries such as Russia, 
Argentina, Brazil, and Venezuela have defaulted several times on their debt. 


4.6.8 Global Macro 


Global macro is the hedge fund strategy used by star managers such as George Soros 
and Julian Robertson. Global macro hedge fund managers carry out trades that reflect 
global macroeconomic trends. They look for situations where markets have, for whatever 
reason, moved away from equilibrium and place large bets that they will move back into 
equilibrium. Often the bets are on exchange rates and interest rates. A global macro 
strategy was used in 1992 when George Soros’s Quantum Fund gained $1 billion by 
betting that the British pound would decrease in value. More recently, hedge funds have 
(with mixed results) placed bets that the huge U.S. balance of payments deficit would 
cause the value of the U.S. dollar to decline. The main problem for global macro funds is 
that they do not know when equilibrium will be restored. World markets can for various 
reasons be in disequilibrium for long periods of time. 


4.6.9 Managed Futures 


Hedge fund managers that use managed futures strategies attempt to predict future move- 
ments in commodity prices. Some rely on the manager’s judgment; others use computer 
programs to generate trades. Some managers base their trading on technical analysis, 
which analyzes past price patterns to predict the future. Others use fundamental analysis, 
which involves calculating a fair value for the commodity from economic, political, and 
other relevant factors. 
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When technical analysis is used, trading rules are usually first tested on historical data. 
This is known as back-testing. If (as is often the case) a trading rule has come from an 
analysis of past data, trading rules should be tested out of sample (that is, on data that are 
different from the data used to generate the rules). Analysts should be aware of the perils 
of data mining. Suppose thousands of different trading rules are generated and then tested 
on historical data. Just by chance a few of the trading rules will perform very well—but 
this does not mean that they will perform well in the future. 


4.7 Hedge Fund Performance 


It is not as easy to assess hedge fund performance as it is to assess mutual fund performance. 
There is no data set that records the returns of all hedge funds. For the Lipper hedge 
funds database (TASS), which is available to researchers, participation by hedge funds 
is voluntary. Small hedge funds and those with poor track records often do not report 
their returns and are therefore not included in the data set. When returns are reported 
by a hedge fund, the database is usually backfilled with the fund’s previous returns. This 
creates a bias in the returns that are in the data set because, as just mentioned, the hedge 
funds that decide to start providing data are likely to be the ones doing well. When this 
bias is removed, some researchers have argued, hedge fund returns have historically been 
no better than mutual fund returns, particularly when fees are taken into account. 

Arguably, hedge funds can improve the risk-return trade-offs available to pension 
plans. This is because pension plans cannot (or choose not to) take short positions, obtain 
leverage, invest in derivatives, and engage in many of the complex trades that are favored 
by hedge funds. Investing in a hedge fund is a simple way in which a pension fund can 
(for a fee) expand the scope of its investing. This may improve its efficient frontier. (See 
Section 1.2 for a discussion of efficient frontiers.) 

It is not uncommon for hedge funds to report good returns for a few years and then 
“blow up.” Long-Term Capital Management reported returns (before fees) of 28%, 59%, 
57%, and 17% in 1994, 1995, 1996, and 1997, respectively. In 1998, it lost virtually all 
its capital. Some people have argued that hedge fund returns are like the returns from 
writing out-of-the-money options. Most of the time, the options cost nothing, but every 
so often they are very expensive. 

This may be unfair. Advocates of hedge funds would argue that hedge fund man- 
agers search for profitable opportunities that other investors do not have the resources 
or expertise to find. They would point out that the top hedge fund managers have been 
very successful at finding these opportunities. 

Prior to 2008, hedge funds performed quite well. In 2008, hedge funds on average 
lost money but provided a better performance than the S&P 500. During the years 2009 
to 2016, the S&P 500 provided a much better return than the average hedge fund? The 


*Tt should be pointed out that hedge funds often have a beta less than one (for example, long/short 
equity funds are often designed to have a beta close to zero), so a return less than the S&P 500 during 
periods when the market does very well does not necessarily indicate a negative alpha. 
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Table 4.5 Performance of Hedge Funds 


Barclays Hedge Fund S&P 500 Return 
Year Index Net Return (%) Including Dividends (%) 
2008 —21.63 —37.00 
2009 23.74 26.46 
2010 10.88 15.06 
2011 —5.48 2.11 
2012 8.25 16.00 
2013 11.12 32.39 
2014 2.88 13.39 
2015 0.04 1.38 
2016 6.10 11.96 


Barclays hedge fund index is the arithmetic average return (net after fees) of all hedge 
funds (excluding funds of funds) in the Barclays database. (Potentially, it has some of 
the biases mentioned earlier.) Table 4.5 compares returns given by the index with total 
returns from the S&P 500. 


Summary 


Mutual funds and ETFs offer a way small investors can capture the benefits of diversifica- 
tion. Overall, the evidence is that actively managed funds do not outperform the market 
and this has led many investors to choose funds that are designed to track a market index 
such as the S&P 500. 

Most mutual funds are open-end funds, so that the number of shares in the fund 
increases (decreases) as investors contribute (withdraw) funds. An open-end mutual fund 
calculates the net asset value of shares in the fund at 4 p.m. each business day and this is 
the price used for all buy and sell orders placed in the previous 24 hours. A closed-end 
fund has a fixed number of shares that trade in the same way as the shares of any other 
corporation. 

Exchange-traded funds (ETFs) are proving to be popular alternatives to open- and 
closed-end mutual funds. The shares held by the fund are known at any given time. 
Large institutional investors can exchange shares in the fund at any time for the assets 
underlying the shares, and vice versa. This ensures that the shares in the ETF (unlike 
shares in a closed-end fund) trade at a price very close to the fund’s net asset value. Shares 
in an ETF can be traded at any time (not just at 4 P.M.) and shares in an ETF (unlike 
shares in an open-end mutual fund) can be shorted. 

Hedge funds cater to the needs of large investors. Compared to mutual funds, they 
are subject to very few regulations and restrictions. Hedge funds charge investors much 
higher fees than mutual funds. The fee for a typical fund is “2 plus 20%.” This means 
that the fund charges a management fee of 2% per year and receives 20% of the profit 
after management fees have been paid generated by the fund if this is positive. Hedge 
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fund managers have a call option on the assets of the fund and, as a result, may have an 
incentive to take high risks. 

Among the strategies followed by hedge funds are long/short equity, dedicated 
short, distressed securities, merger arbitrage, convertible arbitrage, fixed-income arbitrage, 
emerging markets, global macro, and managed futures. The jury is still out on whether 
hedge funds provide better risk-return trade-offs than index funds after fees. There is an 
unfortunate tendency for hedge funds to provide excellent returns for a number of years 
and then report a disastrous loss. 
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Practice Questions and Problems 
(Answers at End of Book) 


4.1 What is the difference between an open-end and closed-end mutual fund? 

4.2 How is the NAV of an open-end mutual fund calculated? When is it calculated? 

4.3 An investor buys 100 shares in a mutual fund on January 1, 2018, for $30 each. 
The fund makes capital gains in 2018 and 2019 of $3 per share and $1 per share, 
respectively, and earns no dividends. The investor sells the shares in the fund during 
2020 for $32 per share. What capital gains or losses is the investor deemed to have 
made in 2018, 2019, and 2020? 

4.4 What is an index fund? How is it created? 

4.5 What is a mutual fund’s (a) front-end load and (b) back-end load? 

4.6 Explain how an exchange-traded fund that tracks an index works. What are the 
advantages of an exchange-traded fund over (a) an open-end mutual fund and (b) 
a closed-end mutual fund? 

4.7 What is the difference between the geometric mean and the arithmetic mean of 
a set of numbers? Why is the difference relevant to the reporting of mutual fund 
returns? 

4.8 Explain the meaning of (a) late trading, (b) market timing, (c) front running, and 
(d) directed brokerage. 

4.9 Give three examples of the rules that apply to mutual funds, but not to hedge funds. 

4.10 “If 70% of convertible bond trading is by hedge funds, I would expect the prof- 
itability of that strategy to decline.” Discuss this viewpoint. 
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4.13 


4.14 


4.15 


4.16 


4.17 


4.18 


Explain the meanings of the terms hurdle rate, high-water mark clause, and claw- 
back clause when used in connection with the incentive fees of hedge funds. 

A hedge fund charges 2 plus 20%. Investors want a return after fees of 20%. How 
much does the hedge fund have to earn, before fees, to provide investors with this 
return? Assume that the incentive fee is paid on the net return after management 
fees have been subtracted. 

“It is important for a hedge fund to be right in the long term. Short-term gains 
and losses do not matter.” Discuss this statement. 

“The risks that hedge funds take are regulated by their prime brokers.” Discuss this 
statement. 


Further Questions 


An investor buys 100 shares in a mutual fund on January 1, 2018, for $50 each. 
The fund earns dividends of $2 and $3 per share during 2018 and 2019. These are 
reinvested in the fund. The fund's realized capital gains in 2018 and 2019 are $5 per 
share and $3 per share, respectively. The investor sells the shares in the fund during 
2020 for $59 per share. Explain how the investor is taxed. 

Good years are followed by equally bad years for a mutual fund. It earns +8%, —8%, 
+12%, —12% in successive years. What is the investor’s overall return for the four 
years? 

A fund of funds divides its money between five hedge funds that earn —5%, 1%, 
10%, 15%, and 20% before fees in a particular year. The fund of funds charges 1 
plus 10% and the hedge funds charge 2 plus 20%. The hedge funds’ incentive fees 
are calculated on the return after management fees. The fund of funds incentive 
fee is calculated on the net (after management and incentive fees) average return 
of the hedge funds in which it invests and after its own management fee has been 
subtracted. What is the overall return on the investments? How is it divided between 
the fund of funds, the hedge funds, and investors in the fund of funds? 

A hedge fund charges 2 plus 20%. A pension fund invests in the hedge fund. Plot 
the return to the pension fund as a function of the return to the hedge fund. 


Chapter 5 


Trading in Financial Markets 


inancial institutions do a huge volume of trading in a wide range of different 
F financial instruments. There are a number of reasons for this. Some trades are 

designed to satisfy the needs of their clients, some are to manage their own risks, 
some are to exploit arbitrage opportunities, and some are to reflect their own views on 
the direction in which market prices will move. (The Volcker rule in the Dodd—Frank 
Act, which will be discussed in Chapter 16, prevents U.S. banks from trading for the last 
two reasons.) 

We will discuss the approaches a financial institution uses to manage its trading risks in 
later chapters. The purpose of this chapter is to set the scene by describing the instruments 
that they trade, how they trade, and the ways they are used. As different exchange-traded 
products are described, we will discuss the use of margin by exchanges. Margin is collateral 
(usually in the form of cash) that an exchange requires from traders to guarantee that they 
will not walk away from their obligations. We will see in later chapters that margin has 
now become an important feature of over-the-counter (OTC) markets as well. 


5.1 The Markets 


There are two markets for trading financial instruments. These are the exchange-traded 
market and the over-the-counter market (or OTC market). 


5.1.1 Exchange-Traded Markets 


Exchanges have been used to trade financial products for many years. Some exchanges 
such as the New York Stock Exchange (NYSE; www.nyse.com) focus on the trading of 
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stocks. Others such as the Chicago Board Options Exchange (CBOE; www.cboe.com) 
and CME Group (CME; www.cmegroup.com) are concerned with the trading of deriva- 
tives such as futures and options. 

The role of the exchange is to define the contracts that trade and organize trading 
so that market participants can be sure that the trades they agree to will be honored. In 
the past, individuals have met at the exchange and agreed on the prices for trades, often 
by using an elaborate system of hand signals. This is known as the open outcry system. 
Exchanges have now largely replaced the open outcry system with electronic trading. This 
involves traders entering their desired trades at a keyboard and a computer being used 
to match buyers and sellers. Not everyone agrees that the shift to electronic trading is 
desirable. Electronic trading is less physically demanding than traditional floor trading. 
However, in some ways, it is also less exciting. Traders do not have the opportunity to 
attempt to predict short-term market trends from the behavior and body language of 
other traders. 

Sometimes trading is facilitated with market makers. These are individuals or com- 
panies who are always prepared to quote both a bid price (price at which they are prepared 
to buy) and an offer price (price at which they are prepared to sell). For example, at the 
request of a trader, a market maker might quote “bid 30.30, offer 30.35” for a particular 
share, indicating a willingness to buy at $30.30 and sell at $30.35. At the time quotes are 
provided to the trader, the market maker does not know whether the trader wishes to 
buy or sell. Typically the exchange will specify an upper bound for the spread between a 
market maker’s bid and offer prices. The market maker earns its profit from this spread, 
but must manage its inventories carefully to limit its exposure to price changes. 


5.1.2 Over-the-Counter Markets 


The OTC market is a huge network of traders who work for financial institutions, large 
corporations, or fund managers. It is used for trading many different products including 
bonds, foreign currencies, and derivatives. Banks are very active participants in the 
market and often act as market makers for the more commonly traded instruments. For 
example, most banks are prepared to provide bid and offer quotes on a range of different 
exchange rates. 

A key advantage of the over-the-counter market is that the terms of a contract do not 
have to be those specified by an exchange. Market participants are free to negotiate any 
mutually attractive deal. Phone conversations in the over-the-counter market are usually 
taped. If there is a dispute over an agreement reached by phone, the tapes are replayed to 
resolve the issue. Trades in the over-the-counter market are typically much larger than 
trades in the exchange-traded market. 


5.2 Clearing Houses 


Exchange-traded derivatives contracts are administered by a clearing house. The clear- 
ing house has a number of members, and trades by non-members have to be channeled 
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through members for clearing. The members of the clearing house contribute to a guar- 
anty fund that is managed by the clearing house. 

Suppose that, in a particular exchange-traded market, Trader X agrees to sell one 
futures contract to Trader Y. The clearing house in effect stands between the two traders 
so that Trader X is selling the contract to the clearing house and Trader Y is buying the 
contract from the clearing house. The advantage of this is that Trader X does not need 
to worry about the creditworthiness of Trader Y, and vice versa. Both traders deal only 
with the clearing house. If a trader is a clearing house member, the trader deals directly 
with the clearing house. Otherwise, the trader deals with the clearing house through a 
clearing house member. 

When a trader has potential future liabilities from a trade (e.g., when the trader is 
entering into a futures contract or selling an option), the clearing house requires the 
trader to provide cash or marketable securities as collateral. The word used by clearing 
houses to describe collateral is margin. Without margin, the clearing house is taking the 
risk that the market will move against the trader and the trader will not fulfill his or her 
obligations. The clearing house aims to set margin requirements sufficiently high that it is 
over 99% certain that this will not happen. On those few occasions where it does happen, 
the guaranty fund is used. As a result, failures by clearing houses are extremely rare. 

Some OTC trades have been cleared through clearing houses, known as central 
counterparties (CCPs), for many years. A CCP plays a similar role to an exchange clear- 
ing house. It stands between the two sides in a transaction so that they do not have credit 
exposure to each other. Like an exchange clearing house, a CCP has members who 
contribute to a guaranty fund, and provide margin to guarantee their performance. Reg- 
ulations now require standardized derivatives between financial institutions to be cleared 
through CCPs. A fuller discussion of this is in Chapter 17. 


5.3 Long and Short Positions in Assets 


The simplest type of trade is the purchase of an asset for cash or the sale of an asset that 
is owned for cash. Examples of such trades are: 


e The purchase of 100 IBM shares 

e The sale of 1 million British pounds for U.S. dollars 

e The purchase of 1,000 ounces of gold 

e The sale of $1 million worth of bonds issued by General Motors 


The first of these trades would typically be done on an exchange; the other three would 
be done in the over-the-counter market. The trades are sometimes referred to as spot 
trades because they lead to almost immediate (on the spot) delivery of the asset. 

When purchasing an asset through a broker, a trader can often borrow up to half the 
cost of the asset from the broker, pledging the asset as collateral. This is known as buying 
on margin. The broker’s risk is that the price of the asset will decline sharply. Accordingly, 
the broker monitors the margin account balance. This is the part of the purchase price 
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paid by the investor adjusted for gains or losses on the asset. If the balance in the margin 
account falls below 25% of the value of the asset, the trader is required to bring the 
balance back up to that level. Suppose a trader buys 1,000 shares at $120 per share on 
margin. The trader will have to provide half the cost or $60,000. If the share price falls to 
$78, there is a $42,000 loss and the balance in the margin account becomes $18,000. This 
is 18,000/78,000 or 23.1% of the value of the asset, and the trader will have to provide 
$1,500 of cash to bring the balance in the margin account up to 25% of $78,000. If the 
trader fails to do this, the broker sells the position. 


5.3.1 Short Sales 


In some markets, it is possible to sell an asset that you do not own with the intention 
of buying it back later. This is referred to as shorting the asset. We will illustrate how it 
works by considering the shorting of shares of a stock. 

Suppose an investor instructs a broker to short 500 shares of a certain stock. The 
broker will carry out the instructions by borrowing the shares from another client and 
selling them on an exchange in the usual way. (A small fee may be charged for the 
borrowed shares.) The investor can maintain the short position for as long as desired, 
provided there are always shares available for the broker to borrow. At some stage, however, 
the investor will close out the position by purchasing 500 shares. These are then replaced 
in the account of the client from whom the shares were borrowed. The investor takes a 
profit if the stock price has declined and a loss if it has risen. If, at any time while the 
contract is open, the broker runs out of shares to borrow, the investor is short-squeezed 
and is forced to close out the position immediately, even if not ready to do so. 

An investor with a short position must pay to the broker any income, such as divi- 
dends or interest, that would normally be received on the asset that has been shorted. The 
broker will transfer this to the client account from which the asset was borrowed. Suppose 
a trader shorts 500 shares in April when the price per share is $120 and closes out the posi- 
tion by buying them back in July when the price per share is $100. Suppose further that 
a dividend of $1 per share is paid in May. The investor receives 500 x $120 = $60,000 
in April when the short position is initiated. The dividend leads to a payment by the 
investor of 500 x $1 = $500 in May. The investor also pays 500 x $100 = $50,000 for 
shares when the position is closed out in July. The net gain is, therefore, 


$60,000 — $500 — $50,000 = $9,500 


Table 5.1 illustrates this example and shows that (assuming no fee is charged for borrow- 
ing the shares) the cash flows from the short sale are the mirror image of the cash flows 
from purchasing the shares in April and selling them in July. 

An investor entering into a short position has potential future liabilities and is there- 
fore required to post margin. The margin is typically 150% of the value of the shares 
shorted. The proceeds of the sale typically form part of the margin account, so an extra 
50% of the value of the shares must be provided. 
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Table 5.1 Cash Flows from Short Sale and Purchase of Shares 


Purchase of Shares 


April: Purchase 500 shares for $120 —$60,000 
May: Receive dividend +$500 
July: Sell 500 shares for $100 per share +$50,000 


Net Profit = —$9,500 


Short Sale of Shares 


April: Borrow 500 shares and sell them for $120 +$60,000 
May: Pay dividend —$500 
July: Buy 500 shares for $100 per share —$50,000 


Replace borrowed shares to close short position 
Net Profit = +$9,500 


Suppose that 100 shares are shorted at $60. The margin required is 150% of the 
value of the shares or $9,000. The proceeds of the sale provide $6,000, so the trader must 
provide an extra $3,000. Suppose that a maintenance margin of 125% is set, so that when 
the balance in the margin account falls below 125% of the value of the shares there is a 
margin call requiring that the balance be brought up to this level. If the share price rises 
to $80, 125% of the value of the shares shorted is $10,000, and an extra $1,000 of margin 
would be required. The short trader must provide this to avoid having the short position 
closed out. 

From time to time, regulations are changed on short selling. The SEC abolished the 
uptick rule in the United States in July 2007 and reintroduced it in April 2009. (The 
effect of this rule is to allow shorting only when the most recent movement in the price 
of the stock is an increase.) On September 19, 2008, in an attempt to halt the slide in 
bank stock prices, the SEC imposed a temporary ban on the short selling of 799 financial 
companies. This was similar to a ban imposed by the UK Financial Services Authority 
(FSA) the day before. 


5.4 Derivatives Markets 


A derivative is an instrument whose value depends on (or derives from) other more basic 
market variables. A stock option, for example, is a derivative whose value is dependent 
on the price of a stock. 

Derivatives trade in both the exchange-traded and OTC markets. Both markets 
are huge. Although the statistics that are collected for the two markets are not exactly 
comparable, it is clear that the over-the-counter derivatives market is much larger 
than the exchange-traded derivatives market. The Bank for International Settlements 
(www.bis.org) started collecting statistics on the markets in 1998. Figure 5.1 compares 
(a) the estimated total principal amounts underlying transactions that were outstanding 
in the over-the-counter markets between June 1998 and December 2016 and (b) the 
estimated total value of the assets underlying exchange-traded contracts during the same 
period. Using these measures, in December 2016, the size of the over-the-counter market 
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Figure 5.1 Size of Over-the-Counter and Exchange-Traded Derivatives Markets between 1998 
and 2016 


was $482.9 trillion and the size of the exchange-traded market was $67.2 trillion! Figure 
5.1 shows that the OTC market grew rapidly until 2007, but has seen little net growth 
since then. The decline of the market in 2014 and 2015 was largely a result of compression. 
This is a procedure where two or more counterparties restructure transactions with each 
other with the result that the underlying principal is reduced. 

In interpreting these numbers, we should bear in mind that the principal (or value 
of assets) underlying a derivatives transaction is not the same as its value. An example of 
an over-the-counter contract is an agreement entered into some time ago to buy 100 
million U.S. dollars with British pounds at a predetermined exchange rate in one year. 
The total principal amount underlying this transaction is $100 million. However, the 
value of the contract might be only $1 million” The Bank for International Settlements 
estimated the gross market value of all OTC contracts outstanding in June 2017 to be 
about $12.7 trillion. 


5.5 Plain Vanilla Derivatives 


This section discusses the standard, or commonly traded, contracts in derivatives mar- 
kets: forwards, futures, swaps, and options. They are sometimes referred to as plain vanilla 
products. 


'When a contract is cleared through a CCP, two offsetting contracts are created (as will be described 
in Chapter 17). This inflates the OTC statistics. 

? A contract that is worth $1 million to one side and —$1 million to the other side would be counted 
as having a gross market value of $1 million. 
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Table 5.2 Spot and Forward Quotes for the USD/GBP Exchange 
Rate, June 9, 2017 (GBP = British pound; USD = USS. dollar; quote is 
number of USD per GBP) 


Bid Offer 
Spot 1.2732 1.2736 
1-month forward 1.2746 1.2751 
3-month forward 1.2772 1.2777 
1-year forward 1.2883 1.2889 


5.5.1 Forward Contracts 


A forward contract is an agreement to buy an asset in the future for a certain price. For- 
ward contracts trade in the over-the-counter market. One of the parties to a forward 
contract assumes a long position and agrees to buy the underlying asset on a certain spec- 
ified future date for a certain specified price. The other party assumes a short position and 
agrees to sell the asset on the same date for the same price. 

Forward contracts on foreign exchange are very popular. Table 5.2 provides quotes 
on the exchange rate between the British pound (GBP) and the U.S. dollar (USD) that 
might be provided by a large international bank on June 9, 2017 (the day after the UK 
election when Theresa May lost her parliamentary majority). The quotes are for the 
number of USD per GBP. The first row indicates that the bank is prepared to buy GBP 
(also known as sterling) in the spot market (i.e., for virtually immediate delivery) at the 
rate of $1.2732 per GBP and sell sterling in the spot market at $1.2736 per GBP; the 
second row indicates that the bank is prepared to buy sterling in one month at $1.2746 
per GBP and sell sterling in one month at $1.2751 per GBP; and so on. 

Forward contracts can be used to hedge foreign currency risk. Suppose that on June 9, 
2017, the treasurer of a U.S. corporation knows that the corporation will pay £1 million 
in one year (on June 9, 2018) and wants to hedge against exchange rate moves. The 
treasurer can agree to buy £1 million one-year forward at an exchange rate of 1.2889 
by trading with the bank providing the quotes in Table 5.2. The corporation then has a 
long forward contract on GBP It has agreed that on June 9, 2018, it will buy £1 million 
from the bank for $1.2889 million. The bank has a short forward contract on GBP. It 
has agreed that on June 9, 2018, it will sell £1 million for $1.2889 million. Both the 
corporation and the bank have made a binding commitment. 

What are the possible outcomes in the trade we have just described? The forward 
contract obligates the corporation to purchase 1 million pounds at an exchange rate 
of 1.2889 in one year. If the spot exchange rate applicable to the corporation when 
buying pounds rose to, say, 1.5000, at the end of one year the forward contract leads to 1 
million pounds being purchased by the corporation at an exchange rate of 1.2889 rather 
than at the spot exchange rate of 1.5000. This is worth $211,100 = (1.5000 — 1.2889) x 
$1,000,000) to the corporation. Of course, it may also happen that the contract will 
have a negative final value to the corporation. If the exchange rate falls to 1.1000 by 
the end of the year, the forward contract leads to 1 million pounds being purchased by 
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Figure 5.2 Payofts from Forward Contracts 
(a) Long position to buy 1 million British pounds in one year. (b) Short position to sell 1 million 
British pounds in one year. Forward price = 1.2889. 


the corporation at an exchange rate of 1.2889 rather than at the 1.1000 available in the 
market. This costs the corporation $188,900 (= (1.2889 — 1.1000) x $1,000,000). This 
example shows that a long forward contract can lead to a payoff that is a gain or loss. The 
payoff is the spot price of the assets underlying the forward contract minus the agreed 
delivery price for the assets and is shown in Figure 5.2(a). 

The bank in our example has entered into a short forward contract. Its position 
is the mirror image of that of the corporation. The bank has agreed to sell 1 million 
pounds for an exchange rate of 1.2889 in one year. If the spot exchange rate applica- 
ble to the bank rose to 1.5000, at the end of the year the forward contract leads to 1 
million pounds being sold by the bank at an exchange rate of 1.2889 rather than at 
the spot exchange rate of 1.5000. This costs the bank $211,100. If the exchange rate 
falls to 1.1000 by the end of the year, the forward contract leads to 1 million pounds 
being sold by the bank at an exchange rate of 1.2889 rather than 1.1000. This is worth 
$188,900 to the bank. The payoff is the agreed delivery price for the assets underly- 
ing the forward contract minus the spot price and is shown in Figure 5.2(b). The val- 
uation of forward contracts and the determination of forward prices are discussed in 
Appendix C. 


5.5.2 Futures Contracts 


Futures contracts, like forward contracts, are agreements to buy an asset at a future time. 
Unlike forward contracts, futures are traded on an exchange. This means that the contracts 
that trade are standardized. The exchange defines the amount of the asset underlying one 
contract, when delivery can be made, exactly what can be delivered, and so on. Contracts 
are referred to by their delivery month. For example, the September 2019 gold futures 
is a contract where delivery is made in September 2019. Whereas only one delivery day 
is usually specified for a forward contract, there is usually a period of time during the 
delivery month when delivery can take place in a futures contract. Alternative delivery 
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BUSINESS SNAPSHOT 5.1 
The Unanticipated Delivery of a Futures Contract 


This story (which may well be apocryphal) was told to the author of this book 
many years ago by a senior executive of a financial institution. It concerns a 
new employee of the financial institution who had not previously worked in the 
financial sector. One of the clients of the financial institution regularly entered 
into a long futures contract on live cattle for hedging purposes and issued instruc- 
tions to close out the position on the last day of trading. (Live cattle futures con- 
tracts trade on the Chicago Mercantile Exchange and each contract is on 40,000 
pounds of cattle.) The new employee was given responsibility for handling the 
account. 

When the time came to close out a contract, the employee noted that the 
client was long one contract and instructed a trader at the exchange go long (not 
short) one contract. The result of this mistake was that the financial institution 
ended up with a long position in two live cattle futures contracts. By the time 
the mistake was spotted, trading in the contract had ceased. 

The financial institution (not the client) was responsible for the mistake. As 
a result it started to look into the details of the delivery arrangements for live 
cattle futures contracts—something it had never done before. Under the terms 
of the contract, cattle could be delivered by the party with the short position to 
a number of different locations in the United States during the delivery month. 
Because it was long, the financial institution could do nothing but wait for a 
party with a short position to issue a notice of intention to deliver to the exchange 
and for the exchange to assign that notice to the financial institution. 

It eventually received a notice from the exchange and found that it would 
receive live cattle at a location 2,000 miles away the following Tuesday. The new 
employee was dispatched to the location to handle things. It turned out that the 
location had a cattle auction every Tuesday. The party with the short position that 
was making delivery bought cattle at the auction and then immediately delivered 
them. Unfortunately, the cattle could not be resold until the next cattle auction 
the following Tuesday. The employee was therefore faced with the problem of 
making arrangements for the cattle to be housed and fed for a week. This was a 


great start to a first job in the financial sector! 


times, delivery locations, and so on, are defined by the exchange. It is nearly always the 
party with the short position that has the right to initiate the delivery process and choose 
between the alternatives. As we will explain shortly, futures traders have potential future 
liabilities and are required to post margin. 
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Most futures contracts trade actively with the futures price at any given time being 
determined by supply and demand. If there are more buyers than sellers at a time when 
the September 2019 future price of gold is $1,280 per ounce, the price goes up. Similarly, 
if there are more sellers than buyers, the price goes down. 

One of the attractive features of exchange-traded contracts such as futures is that 
it is easy to close out a position. If you buy (i.e., take a long position in) a September 
2019 gold futures contract on March 5, 2019, you can exit on June 5, 2019, by selling 
(i.e., taking a short position in) the same contract. Closing out a position in a forward 
contract is not as easy as closing out a position in a futures contract. As a result, forward 
contracts usually lead to final delivery of the underlying assets, whereas futures contracts 
are usually closed out before the delivery month is reached. Business Snapshot 5.1 is an 
amusing story showing that the assets underlying futures contracts do get delivered if 
mistakes are made in the close out. 

The futures price of an asset is usually very similar to its forward price. Appendix 
C at the end of the book gives the relationship between the futures or forward price 
of an asset and its spot price. One difference between a futures and a forward contract 
is that a futures is settled daily whereas a forward is settled at the end of its life. For 
example, if a futures price increases during a day, money flows from traders with short 
positions to traders with long positions at the end of the day. Similarly, if a futures price 
decreases during a day, money flows in the opposite direction. Because a futures contract 
is settled daily whereas a forward contract is settled at the end of its life, the timing of the 
realization of gains and losses is different for the two contracts. This sometimes causes 
confusion, as indicated in Business Snapshot 5.2. Table 5.3 summarizes the difference 
between forward and futures contracts. 

Futures are cleared on an exchange clearing house. The clearing house stands 
between the two sides and is responsible for ensuring that required payments are made. 
The clearing house has a number of members. If a trader or broker is not a member of 
the exchange clearing house, it must arrange to clear trades through a member. 

The clearing house requires initial margin and variation margin from its members. 
The variation margin for a day may be positive or negative and covers the gains and 
losses during the day. The initial margin is an extra amount held by the exchange that 


Table 5.3 Comparison of Forward and Futures Contracts 


Forward Futures 

Private contract between two parties Traded on an exchange 

Not standardized Standardized contract 

Usually one specified delivery date Range of delivery dates 

Settled at end of contract Settled daily 

Delivery or final cash settlement usually takes Contract is usually closed out prior to 
place maturity 


Some credit risk Virtually no credit risk 
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BUSINESS SNAPSHOT 5.2 
A Software Error? 


A foreign exchange trader working for a bank enters into a long forward contract 
to buy 1 million pounds sterling at an exchange rate of 1.3000 in three months. 
At the same time, another trader on the next desk takes a long position in 16 
three-month futures contracts on sterling. The futures price is 1.3000 and each 
contract is on 62,500 pounds. The sizes of the positions taken by the forward 
and futures traders are therefore the same. Within minutes of the positions being 
taken, the forward and the futures prices both increase to 1.3040. The bank’s 
systems show that the futures trader has made a profit of $4,000 while the forward 
trader has made a profit of only $3,900. The forward trader immediately calls 
the bank’s systems department to complain. Does the forward trader have a valid 
complaint? 

The answer is no! The daily settlement of futures contracts ensures that the 
futures trader realizes an almost immediate profit corresponding to the increase 
in the futures price. If the forward trader closed out the position by entering into 
a short contract at 1.3040, the forward trader would have contracted to buy 1 
million pounds at 1.3000 in three months and sell 1 million pounds at 1.3040 
in three months. This would lead to a $4,000 profit—but in three months, not 
in one day. The forward trader’s profit is the present value of $4,000. 

The forward trader can gain some consolation from the fact that gains and 
losses are treated symmetrically. If the forward/futures prices dropped to 1.2960 
instead of rising to 1.3040, the futures trader would take a loss of $4,000 while 
the forward trader would take a loss of only $3,900. Also, over the three-month 
contract life, the total gain or loss from the futures contract and the forward 
contract would be the same. 


provides protection to the exchange in the event that the member defaults. In addition, 
the exchange clearing house requires default fund (also known as a guaranty fund) 
contributions from its members. This provides additional protection to the exchange. If 
a member defaults and its initial margin and default fund contributions are not enough 
to cover losses, the default fund contributions of other members can be used. 
Exchange clearing house members require margin from brokers and other traders 
when they agree to clear their trades and brokers require margin from their clients. 
Typically the relationship between broker and client involves the posting of initial margin 
(greater than the initial margin applicable to a member). When the balance in the client’s 
margin account (adjusted for daily gains and losses) falls below a maintenance margin 
level, the client is required to bring the balance back up to the initial margin level. 
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Figure 5.3 A Plain Vanilla Interest Rate Swap 


5.5.3 Swaps 


The first swap contracts were negotiated in the early 1980s. Since then, the market has 
seen phenomenal growth. Swaps now occupy a position of central importance in the 
over-the-counter derivatives market. 

A swap is an agreement between two companies to exchange cash flows in the future. 
The agreement defines the dates when the cash flows are to be paid and the way in 
which they are to be calculated. Usually the calculation of the cash flows involves the 
future values of interest rates, exchange rates, or other market variables. 

A forward contract can be viewed as a simple example of a swap. Suppose it is March 
1, 2019, and a company enters into a forward contract to buy 100 ounces of gold for 
$1,300 per ounce in one year. The company can sell the gold in one year as soon as it 
is received. The forward contract is therefore equivalent to a swap where the company 
agrees that on March 1, 2020, it will swap 100 times the spot price of gold for $130,000. 

Whereas a forward contract is equivalent to the exchange of cash flows on just one 
future date, swaps typically lead to cash flow exchanges taking place on several future 
dates. The most common swap is a “plain vanilla” interest rate swap where a fixed rate 
of interest is exchanged for IBOR? Both interest rates are applied to the same notional 
principal. (The principal is “notional” because it is used only for the determination of 
interest exchanges. The principal itself is not exchanged.) A swap where Company A 
pays a fixed rate of interest of 3% and receives LIBOR is shown in Figure 5.3. (Note that 
all rates in this example are semiannually compounded.) Suppose that in this contract 
interest rates are reset every six months, the notional principal is $100 million, and the 
swap lasts for three years. Table 5.4 shows the cash flows to Company A if six-month 
LIBOR interest rates prove to be those shown in the second column of the table. The 
swap is entered into on March 3, 2019. The six-month interest rate on that date is 2.2% 
per year or 1.1% per six months. As a result, the floating-rate cash flow received six 
months later, on September 3, 2019, is 0.011 X 100 or $1.1 million. Similarly, the six- 
month interest rate of 2.8% per annum (or 1.4% per six months) on September 3, 2019, 
leads to the floating cash flow received six months later (on March 3, 2020) being $1.4 
million; and so on. The fixed-rate cash flow paid is always $1.5 million (3% of $100 
million applied to a six-month period). Note that the timing of cash flows corresponds 


3LIBOR is the London Interbank Offered Rate. It is the rate at which a AA-rated bank is able to 
borrow from another bank and is discussed in Chapter 9. 
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Table 5.4 Example of Cash Flows ($ millions) to Company A in Swap in Figure 
5.3. Swap lasts three years and has a principal of $100 million. 


6-Month LIBOR Floating Cash Fixed Cash Net 

Date Rate (% per annum) Flow Received Flow Paid Cash Flow 
Mar. 3, 2019 2.20 

Sep. 3, 2019 2.80 +1.10 —1.50 —0.40 
Mar. 3, 2020 3.30 +1.40 —1.50 —0.10 
Sep. 3, 2020 3.50 +1.65 —1.50 +0.15 
Mar. 3, 2021 3.60 +1.75 —1.50 +0.25 
Sep. 3, 2021 3.90 +1.80 —1.50 +0.30 
Mar. 3, 2022 +1.95 —1.50 +0.45 


Cash flows do not take account of day count conventions, holiday calendars, and so on. Interest rates are 
semiannually compounded. 


to the usual way interest rates such as LIBOR work. The interest is observed at the 
beginning of the period to which it applies and paid at the end of the period. 

Plain vanilla interest rate swaps are very popular because they can be used for many 
purposes. For example, the swap in Figure 5.3 could be used by Company A to transform 
borrowings at a floating rate of LIBOR plus 1% to borrowings at a fixed rate of 4%. The 
combination of 


1. Pay interest at LIBOR plus 1% under loan agreement; 
2. Receive LIBOR under swap agreement; and 
3. Pay 3% under the swap agreement 


nets out to a fixed payment of 4%. It can also be used by Company A to transform an 
investment earning a fixed rate of 2.5% to an investment earning LIBOR minus 0.5%. 
The combination of 


1. Receive 2.5% on the investment; 
2. Receive LIBOR under swap agreement; and 
3. Pay 3% under the swap agreement 


nets out to a floating income at the rate of LIBOR minus 0.5%. 


Example 5.1 


Suppose a bank has floating-rate deposits and five-year fixed-rate loans. As will be dis- 
cussed in Chapter 9, this exposes the bank to significant risks. If rates rise, the deposits 
will be rolled over at high rates and the bank’s net interest income will contract. The 
bank can hedge its risks by entering into the interest rate swap in Figure 5.3 (taking the 
role of Company A). The swap can be viewed as transforming the floating-rate deposits 
to fixed-rate deposits. Alternatively, it can be viewed as transforming the fixed-rate loans 
to floating-rate loans. 
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Table 5.5 Swap Quotes Made by a Market Maker (percent per annum) 


Maturity (years) Bid Offer Swap Rate 
2 2.55 2.58 2.565 
3 2.97 3.00 2.985 
4 3.15 3.19 3.170 
5 3.26 3.30 3.280 
7 3.40 3.44 3.420 
10 3.48 3.52 3.500 


Table 5.5 shows quotes for U.S. dollar swaps that might be posted by the derivatives 
dealer in Figure 5.3.4 The first row shows that the bank is prepared to enter into a two- 
year swap where it pays a fixed rate of 2.55% and receives LIBOR. It is also prepared 
to enter into a swap where it receives 2.58% and pays LIBOR. The bid—offer spread in 
Table 5.5 is 3 or 4 basis points. The average of the bid and offered fixed rates is known 
as the swap rate. This is shown in the final column of the table. 

The valuation of plain vanilla interest rate swaps is discussed in Appendix D at the 


end of this book. 


5.5.4 Options 


Options are traded both on exchanges and in the over-the-counter market. There are 
two basic types of option. A call option gives the holder the right to buy the underlying 
asset by a certain date for a certain price. A put option gives the holder the right to sell 
the underlying asset by a certain date for a certain price. The price in the contract is 
known as the exercise price or strike price; the date in the contract is known as the expiration 
date or maturity date. American options can be exercised at any time up to the expiration 
date. European options can be exercised only on the expiration date itself. Most of the 
options that are traded on exchanges are American. In the exchange-traded equity option 
market, one contract is usually an agreement to buy or sell 100 shares. European options 
are generally easier to analyze than American options, and some of the properties of an 
American option are frequently deduced from those of its European counterpart. 

An at-the-money option is an option where the strike price equals the price of the 
underlying asset An out-of-the-money option is a call option where the strike price is 
above the price of the underlying asset or a put option where the strike price is below 


‘The standard swap in the United States is one where fixed payments made every six months are 
exchanged for floating LIBOR payments made every three months. In Table 5.4, we assumed that 
fixed and floating payments are exchanged every six months. 

5 Note that the terms American and European do not refer to the location of the option or the exchange. 
A few of the options trading on North American exchanges are European. 

© Other definitions of “‘at-the-money” are on occasion used. For example, an at-the money option is 
sometimes defined as one where the present value of the strike price equals the asset price (the present 
value being determined by discounting the strike price from the end of the life of the option to the 
present). An at-the-money option is also sometimes defined as a call option with a delta of 0.5 ora 
put option with a delta of —0.5. (See Section 8.1 for a definition of delta.) 
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Table 5.6 Prices of Options on Intel, June 12, 2017 (stock price = $35.91) 


Strike Calls Puts 

Price ($) Aug. 17 Oct. 17 Dec. 17 Aug. 17 Oct. 17 Dec. 17 
34 2.48 2.88 3.15 0.63 1.07 1.42 
35 1.77 2.23 2.53 0.95 1.41 1.80 
36 1.18 1.66 1.98 1.39 1.85 2.25 
37 0.74 1.20 1.22 1.98 2.40 2.79 


this price. An in-the-money option is a call option where the strike price is below the price 
of the underlying asset or a put option where the strike price is above this price. 

It should be emphasized that an option gives the holder the right to do something. 
The holder does not have to exercise this right. By contrast, in a forward and futures 
contract, the holder is obligated to buy or sell the underlying asset. Note that whereas it 
costs nothing to enter into a forward or futures contract, there is a cost to acquiring an 
option. This cost is referred to as the option premium. 

The largest exchange in the world for trading stock options is the Chicago Board 
Options Exchange (CBOE; www.cboe.com). Table 5.6 gives the most recent trading 
prices ofsome of the American options trading on Intel (ticker INTC) at a particular time 
on June 12, 2017. The option strike prices are $34, $35, $36, and $37. The maturities are 
August 2017, October 2017, and December 2017. The August options have an expiration 
date of August 18, 2017; the October options have an expiration date of October 20, 
2017; and the December options have an expiration date of December 15, 2017.’ Intel’s 
stock price was $35.91 

Suppose an investor instructs a broker to buy one October call option contract on 
Intel with a strike price of $36. The broker will relay these instructions to a trader at 
the CBOE. This trader will then find another trader who wants to sell one October call 
contract on Intel with a strike price of $36, and a price will be agreed upon. We assume 
that the price is $1.66, as indicated in Table 5.6. This is the price for an option to buy 
one share. In the United States, one stock option contract is a contract to buy or sell 
100 shares. Therefore, the investor must arrange for $166 to be remitted to the exchange 
through the broker. The exchange will then arrange for this amount to be passed on to 
the party on the other side of the transaction. 

In our example, the investor has obtained at a cost of $166 the right to buy 100 Intel 
shares for $36 each. The party on the other side of the transaction has received $120 and 
has agreed to sell 100 Intel shares for $36 per share if the investor chooses to exercise the 
option. If the price of Intel does not rise above $36 before October 20, 2017, the option 
is not exercised and the investor loses $166. But if the Intel share price does well and the 
option is exercised when it is $50, the investor is able to buy 100 shares at $36 per share 
when they are worth $50 per share. This leads to a gain of $1,400, or $1,234 when the 
initial cost of the options is taken into account. 


7 The exchange chooses the expiration date as the third Friday of the delivery month. 
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Figure 5.4 Net Profit from (a) Buying a Contract Consisting of 100 Intel October Call Options 
with a Strike Price of $36 and (b) Buying a Contract Consisting of 100 Intel December Put Options 
with a Strike Price of $36 


If the investor is bearish on Intel, an alternative trade would be the purchase of one 
December put option contract with a strike price of $36. From Table 5.6, we see that this 
would cost 100 X $2.25 or $225. The investor would obtain at a cost of $225 the right 
to sell 100 Intel shares for $36 per share prior to December 15, 2017. If the Intel share 
price moves above $36 and stays above $36, the option is not exercised and the investor 
loses $225. But if the investor exercises when the stock price is $25, the investor makes a 
gain of $1,100 by buying 100 Intel shares at $25 and selling them for $36. The net profit 
after the cost of the options is taken into account is $875. 

The options trading on the CBOE are American. If we assume for simplicity that 
they are European so that they can be exercised only at maturity, the investor’s profit as a 
function of the final stock price for the Intel options we have been considering is shown 
in Figure 5.4. 

There are four types of trades in options markets: 


1. Buying a call 
2. Selling a call 
3. Buying a put 
4. Selling a put 


Buyers are referred to as having long positions; sellers are referred to as having short positions. 
Selling an option is also known as writing an option. 

When a trader purchases options for cash, there are no margin requirements because 
there are no circumstances under which the trade becomes a liability for the trader in 
the future. Options on stocks and stock indices that last longer than nine months can be 
bought on margin in the United States. The initial and maintenance margin is 75% of 
the value of the option. 

When options are sold (i.e., written), there are potential future liabilities and mar- 
gin must be posted. When a call option on a stock has been written, the initial and 
maintenance margin in the United States is the greater of 


1. 100% of the value of the option plus 20% of the underlying share price less the 
amount, if any, by which the option is out-of-the-money. 
2. 100% of the value of the option plus 10% of the share price. 
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When a put option is written, it is the greater of 


1. 100% of the value of the option plus 20% of the underlying share price less the 
amount, if any, by which the option is out-of-the-money. 
2. 100% of the value of the option plus 10% of the exercise price. 


These margin requirements may be reduced if the trader has other positions in the stock. 
For example, if the trader has a fully covered position (where the trader has sold call 
options on a certain number of shares and owns the same number of shares), there is no 
margin requirement on the short option position. 

Options trade very actively in the over-the-counter market as well as on exchanges. 
The underlying assets include stocks, currencies, and stock indices. Indeed, the over-the- 
counter market for options is now larger than the exchange-traded market. Whereas 
exchange-traded options tend to be American, options trading in the over-the-counter 
market are frequently European. The advantage of the over-the-counter market is that 
maturity dates, strike prices, and contract sizes can be tailored to meet the precise needs 
of a client. They do not have to correspond to those specified by the exchange. The 
sizes of option trades in the over-the-counter are usually much greater than those on 
exchanges. 

Valuation formulas and numerical procedures for options on a variety of assets are in 
Appendices E and F at the end of this book. 


5.5.5 Interest Rate Options 


Important interest rate options that trade in the over-the-counter market are caps, floors, 
and swap options (also known as swaptions). Where a swap exchanges a sequence of floating 
rates for a fixed rate, as indicated in Table 5.4, a cap, as its name implies, caps the floating 
rate. It is a series of call options on a floating rate (usually LIBOR). If the floating rate is 
greater than the strike rate (also known as cap rate), there is a payoff equal to the excess 
of the floating rate over the cap rate, applied to a predetermined notional principal; if the 
floating rate is less than the cap rate, there is no payoff. As in the case of swaps, payofts 
are made at the end of the period covered by an interest rate. 

There is usually no payoff for the first period covered, because the rate for that period 
is known when the contract is entered into. Consider a trader who, on March 3, 2019, 
buys a three-year cap on six-month LIBOR with a cap rate of 3.2% and a principal of 
$100 million. If rates proved to be those indicated in the second column of Table 5.4, 
there would be no payoff on March 3, 2020. The payoff on September 3, 2020, would 
be 0.5 x (0.0330 — 0.0320) x 100 or $0.05 million. Similarly, there would be payoffs of 
$0.15 million, $0.20 million, and $0.35 million on March 3, 2021, September 3, 2021, 
and March 3, 2022, respectively. 

A floor is similarly a series of put options on floating rates. If the instrument we have 
just been considering were a floor rather than a cap, the payoff would be 0.5 Xx (0.0320 — 
0.0280) x 100 or $0.20 million on March 3, 2020, and zero on the other dates. 
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A swap option is an option to enter into a swap at some future time where the fixed 
rate is the strike rate. There are two types of swap options. One is the option to pay the 
strike rate and receive LIBOR;; the other is the option to pay LIBOR and receive the 
strike rate. As in the case of a swap and cap or floor, a notional principal is specified. 


5.6 Non-Traditional Derivatives 


Whenever there are risks in the economy, financial engineers have attempted to devise 
derivatives to allow entities with exposures to manage their risks. Financial institutions 
typically act as intermediaries and arrange for the risks to be passed on to either (a) 
entities that have opposite exposures or (b) speculators who are willing to assume the 
risks. This section gives examples of derivatives that have been developed to meet specific 
needs. 


5.6.1 Weather Derivatives 


Many companies are in the position where their performance is liable to be adversely 
affected by the weathers It makes sense for these companies to consider hedging their 
weather risk in much the same way as they hedge foreign exchange or interest rate risks. 

The first over-the-counter weather derivatives were introduced in 1997. To under- 
stand how they work, we explain two variables: 


HDD: Heating degree days 
CDD: Cooling degree days 


A day’s HDD is defined as 

HDD = max(0, 65 — A) 
and a day’s CDD is defined as 

CDD = max(0, A — 65) 


where A is the average of the highest and lowest temperature during the day at a spec- 
ified weather station, measured in degrees Fahrenheit. For example, if the maximum 
temperature during a day (midnight to midnight) is 68° Fahrenheit and the minimum 
temperature is 44° Fahrenheit, A = 56. The daily HDD is then 9 and the daily CDD 
is 0. 

A typical over-the-counter product is a forward or option contract providing a pay- 
off dependent on the cumulative HDD or CDD during a month (that is, the total of 


8 The U.S. Department of Energy has estimated that one-seventh of the U.S. economy is subject to 
weather risk. 
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the HDDs or CDDs for every day in the month). For example, a dealer could, in Jan- 
uary 2018, sell a client a call option on the cumulative HDD during February 2019 
at the Chicago O’Hare Airport weather station, with a strike price of 700 and a pay- 
ment rate of $10,000 per degree day. If the actual cumulative HDD is 820, the payoff is 
$10, 000 x (820 — 700) = $1.2 million. Often contracts include a payment cap. If the cap 
in our example is $1.5 million, the client’s position is equivalent to a long call option on 
cumulative HDD with a strike price of 700 and a short call option on cumulative HDD 
with a strike price of 850. 

A day’s HDD is a measure of the volume of energy required for heating during 
the day. A days CDD is a measure of the volume of energy required for cooling dur- 
ing the day. Most weather derivatives contracts are entered into by energy producers 
and energy consumers. But retailers, supermarket chains, food and drink manufacturers, 
health service companies, agriculture companies, and companies in the leisure industry 
are also potential users of weather derivatives. The Weather Risk Management Associ- 
ation (www.wrma.org) has been formed to serve the interests of the weather risk man- 
agement industry. 

In September 1999, the Chicago Mercantile Exchange began trading weather futures 
and European options on weather futures. The contracts are on the cumulative HDD 
and CDD for a month observed at a weather station. The contracts are settled in cash just 
after the end of the month once the HDD and CDD are known. One futures contract is 
$20 times the cumulative HDD or CDD for the month. The CME now offers weather 
futures and options for many cities throughout the world. It also offers futures and options 
on hurricanes, frost, and snowfall. 


5.6.2 Oil Derivatives 


Crude oil is one of the most important commodities in the world. Global demand is 
estimated by the United States Energy Information Administration (www.ela.gov) to 
be about 90 million barrels per day. Ten-year fixed-price supply contracts have been 
commonplace in the over-the-counter market for many years. These are swaps where 
oil at a fixed price is exchanged for oil at a floating price. 

There are many grades of crude oil, reflecting variations in gravity and sulfur content. 
Two important benchmarks for pricing are Brent crude oil (which is sourced from the 
North Sea) and West Texas Intermediate (WTI) crude oil. Crude oil is refined into 
products such as gasoline, heating oil, fuel oil, and kerosene. 

In the over-the-counter market, virtually any derivative that is available on common 
stocks or stock indices is now available with oil as the underlying asset. Swaps, forward 
contracts, and options are popular. Contracts sometimes require settlement in cash and 
sometimes require settlement by physical delivery (i.e., by delivery of the oil). 

Exchange-traded contracts on oil are also popular. The CME Group and the Inter- 
continental Exchange (ICE) trade a number of oil futures and futures options contracts. 
Some of the futures contracts are settled in cash; others are settled by physical delivery. 
For example, the Brent crude oil futures traded on ICE have a cash settlement option; 
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the WTI oil futures traded by the CME Group require physical delivery. In both cases, 
the amount of oil underlying one contract is 1,000 barrels. The CME Group also trades 
popular contracts on two refined products: heating oil and gasoline. In both cases, one 
contract is for the delivery of 42,000 gallons. 


5.6.3 Natural Gas Derivatives 


The natural gas industry throughout the world went through a period of deregulation 
and the elimination of government monopolies in the 1980s and 1990s. The supplier of 
natural gas is now not necessarily the same company as the producer of the gas. Suppliers 
are faced with the problem of meeting daily demand. 

A typical over-the-counter contract is for the delivery of a specified amount of nat- 
ural gas at a roughly uniform rate over a one-month period. Forward contracts, options, 
and swaps are available in the over-the-counter market. The seller of gas is usually respon- 
sible for moving the gas through pipelines to the specified location. 

The CME Group trades a contract for the delivery of 10,000 million British thermal 
units of natural gas. The contract, if not closed out, requires physical delivery to be made 
during the delivery month at a roughly uniform rate to a particular hub in Louisiana. 
ICE trades a similar contract in London. 

Natural gas is a popular source of energy for heating buildings. It is also used to 
produce electricity, which in turn is used for air-conditioning. As a result, demand for 
natural gas is seasonal and dependent on the weather. 


5.6.4 Electricity Derivatives 


Electricity is an unusual commodity because it cannot easily be stored? The maximum 
supply of electricity in a region at any moment is determined by the maximum capacity 
of all the electricity-producing plants in the region. In the United States there are 140 
regions known as control areas. Demand and supply are first matched within a control area, 
and any excess power is sold to other control areas. It is this excess power that constitutes 
the wholesale market for electricity. The ability of one control area to sell power to 
another control area depends on the transmission capacity of the lines between the two 
areas. Transmission from one area to another involves a transmission cost, charged by the 
owner of the line, and there are generally some energy transmission losses. 

A major use of electricity is for air-conditioning systems. As a result, the demand 
for electricity, and therefore its price, is much greater in the summer months than in the 
winter months. The nonstorability of electricity causes occasional very large movements 
in the spot price. Heat waves have been known to increase the spot price by as much as 
1,000% for short periods of time. 


? Electricity producers with spare capacity sometimes use it to pump water to the top of their hydro- 
electric plants so that it can be used to produce electricity at a later time. This is the closest they can 
get to storing this commodity. 
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Like natural gas, electricity has been going through a period of deregulation and the 
elimination of government monopolies. This has been accompanied by the development 
of an electricity derivatives market. The CME Group now trades a futures contract on the 
price of electricity, and there is an active over-the-counter market in forward contracts, 
options, and swaps. A typical contract (exchange-traded or over-the-counter) allows one 
side to receive a specified number of megawatt hours for a specified price at a specified 
location during a particular month. In a 5 X 8 contract, power is received for five days a 
week (Monday to Friday) during the off-peak period (11 P.M. to 7 A.M) for the specified 
month.Ina5 X 16 contract, power is received five days a week during the on-peak period 
(7 A.M. to 11 p.m.) for the specified month. In a 7 X 24 contract, power is received around 
the clock every day during the month. Option contracts have either daily exercise or 
monthly exercise. In the case of daily exercise, the option holder can choose on each day 
of the month (by giving one day’s notice) to receive the specified amount of power at the 
specified strike price. When there is monthly exercise, a single decision on whether to 
receive power for the whole month at the specified strike price is made at the beginning 
of the month. 

An interesting contract in electricity and natural gas markets is what is known as a 
swing option or take-and-pay option. In this contract, a minimum and maximum for the 
amount of power that must be purchased at a certain price by the option holder is 
specified for each day during a month and for the month in total. The option holder 
can change (or swing) the rate at which the power is purchased during the month, but 
usually there is a limit on the total number of changes that can be made. 


5.7 Exotic Options and Structured Products 


Many different types of exotic options and structured products trade in the over-the- 
counter market. Although the amount of trading in them is small when compared with 
the trading in the plain vanilla derivatives discussed in Section 5.5, they are important to 
a bank because the profit on trades in exotic options and structured products tends to be 
much higher than on plain vanilla options or swaps. 

Here are a few examples of exotic options: 


Asian options: Whereas regular options provide a payoff based on the price of the underly- 
ing asset at the time of exercise, Asian options provide a payoff based on the average of 
the price of the underlying asset over some specified period. An example is an average 
price call option that provides a payoff in one year equal to max(S — K,0) where S is 
the average asset price during the year and K is the strike price. 

Barrier options: These are options that come into existence or disappear when the price 
of the underlying asset reaches a certain barrier. For example, a knock-out call option 
with a strike price of $30 and a barrier of $20 is a regular European call option that 
ceases to exist if the asset price falls below $20. 
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Basket options: These are options to buy or sell a portfolio of assets rather than options 
on a single asset. 

Binary options: These are options that provide a fixed dollar payoff, or a certain amount 
of the underlying asset, if some condition is satisfied. An example is an option that 
provides a payoff in one year of $1,000 if a stock price is greater than $20. 

Compound options: These are options on options. There are four types: a call on a call, a 
call on a put, a put on a call, and a put on a put. An example of a compound option 
is a European option to buy a European option on a stock currently worth $15. The 
first option expires in one year and has a strike price of $1. The second option expires 
in three years and has a strike price of $20. 

Lookback options: These are options that provide a payoff based on the maximum or 
minimum price of the underlying asset over some period. An example is an option 


that provides a payoff in one year equal to Sy — Smin where Sy is the asset price at the 


min 


end of the year and Smin is the minimum asset price during the year. 


min 


Exotic options are sometimes more appropriate for hedging than plain vanilla 
options. As explained in Business Snapshot 5.3, Microsoft has used Asian options on 
a basket of some of its foreign currency hedging. 

Structured products are products created by banks to meet the needs of investors or 
corporate treasurers. One example of a structured product is a principal protected note, 
where a bank offers an investor the opportunity to earn a certain percentage of the return 
provided by the S&P 500 with a guarantee that the return will not be negative. Another 
example of a (highly) structured product is the 5/30 transaction described in Business 
Snapshot 5.4.'° (In the case of this product, it is debatable whether Bankers Trust was 
meeting a client need or selling the client a product it did not need!) 


5.8 Risk Management Challenges 


Instruments such as futures, forwards, swaps, options, and structured products are versa- 
tile. They can be used for hedging, for speculation, and for arbitrage. (Hedging involves 
reducing risks; speculation involves taking risks; and arbitrage involves attempting to lock 
in a profit by simultaneously trading in two or more markets.) It is this very versatility 
that can cause problems. Sometimes traders who have a mandate to hedge risks or fol- 
low an arbitrage strategy become (consciously or unconsciously) speculators. The results 
can be disastrous. One example of this is provided by the activities of Jérôme Kerviel at 
Société Générale (see Business Snapshot 5.5). 

To avoid the type of problems Société Générale encountered is an important risk 
management challenge. Both financial and nonfinancial corporations must set up controls 
to ensure that derivatives are being used for their intended purpose. Risk limits should 


10 The details of this transaction are in the public domain because it later became the subject of 
litigation. See D. J. Smith, “Aggressive Corporate Finance: A Close Look at the Procter & Gamble— 
Bankers Trust Leveraged Swap,” Journal of Derivatives 4, no. 4 (Summer 1997): 67-79. 
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BUSINESS SNAPSHOT 5.3 
Microsoft’s Hedging 


Microsoft actively manages its foreign exchange exposure. In some countries 
(e.g., Europe, Japan, and Australia), it bills in the local currency and converts its 
net revenue to U.S. dollars monthly. For these currencies, there is a clear exposure 
to exchange rate movements. In other countries (e.g., those in Latin America, 
Eastern Europe, and Southeast Asia), it bills in U.S. dollars. The latter appears to 
avoid any foreign exchange exposure—but it does not. 

Suppose the U.S. dollar strengthens against the currency of a country in 
which Microsoft is billing in U.S. dollars. People in the country will find 
Microsoft’s products more expensive because it takes more of the local currency 
to buy $1. As a result, Microsoft is likely to find it necessary to reduce its (US. 
dollar) price in the country or face a decline in sales. Microsoft therefore has a 
foreign exchange exposure—both when it bills in U.S. dollars and when it bills 
in the local currency. (This shows that it is important for a company to consider 
the big picture when assessing its exposure.) 

Microsoft sometimes uses options for hedging. Suppose it chooses a one-year 
time horizon. Microsoft recognizes that its exposure to an exchange rate (say, 
the Japanese yen—U.S. dollar exchange rate) is an exposure to the average of the 
exchange rates at the end of each month during the year. This is because approx- 
imately the same amount of Japanese yen is converted to U.S. dollars each month. 
Asian options, rather than regular options, are appropriate to hedge its exposure. 
What is more, Microsoft’s total foreign exchange exposure is a weighted aver- 
age of the exchange rates in all the countries in which it does business. This 
means that a basket option, where the option is on a portfolio of currencies, is 
an appropriate tool for hedging. A contract Microsoft likes to negotiate with 
banks is therefore an Asian basket option. The cost of this option is much less 
than a portfolio of put options, one for each month and each exchange rate (see 
Problem 5.23), but it gives Microsoft exactly the protection it wants. 

Microsoft faces other financial risks. For example, it is exposed to interest 
rate risk on its bond portfolio. (When rates rise the portfolio loses money.) It 
also has two sorts of exposure to equity prices. It is exposed to the equity prices 
of the companies in which it invests. It is also exposed to its own equity price 
because it regularly repurchases its own shares as part of its stock awards program. 
It sometimes uses sophisticated option strategies to hedge these risks. 
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BUSINESS SNAPSHOT 5.4 
Procter and Gamble’s Bizarre Deal 


A particularly bizarre swap is the so-called 5/30 swap entered into by Bankers 
Trust (BT) and Procter and Gamble (P&G) on November 2, 1993. This was 
a five-year swap with semiannual payments. The notional principal was $200 
million. BT paid P&G 5.30% per annum. P&G paid BT the average 30-day CP 
(commercial paper) rate minus 75 basis points plus a spread. The average CP rate 
was calculated from observations on the 30-day commercial paper rate each day 
during the preceding accrual period. 

The spread was zero for the first payment date (May 2, 1994). For the remain- 
ing nine payment dates, it was 


98.5 (=a) = Gls TNT Ries 


max | 0, 
100 


In this, five-year CMT is the constant maturity Treasury yield (that is, the yield 
on a five-year Treasury note, as reported by the U.S. Federal Reserve). The 30- 
year TSY price is the midpoint of the bid and offer cash bond prices for the 
6.25% Treasury bond maturing in August 2023. Note that the spread calculated 
from the formula is a decimal interest rate. It is not measured in basis points. If 
the formula gives 0.1 and the average CP rate is 6%, the rate paid by P&G is 6% 
— 0.75% + 10% or 15.25%. 

P&G was hoping that the spread would be zero and the deal would enable it 
to exchange fixed-rate funding at 5.30% for funding at 75 basis points less than 
the commercial paper rate. In fact, interest rates rose sharply in early 1994, bond 
prices fell, and the swap proved very, very expensive. (See Problem 5.38.) 


be set and the activities of traders should be monitored daily to ensure that the risk limits 
are adhered to. We will be discussing this in later chapters. 


Summary 


There are two types of markets in which financial products trade: the exchange-traded 
market and the over-the-counter (OTC) market. The OTC market is undergoing major 
changes as a result of the credit crisis of 2008. These changes have been briefly reviewed 
in this chapter and are discussed in more detail in Chapter 17. 
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BUSINESS SNAPSHOT 5.5 
SocGen’s Big Loss in 2008 


Derivatives are very versatile instruments. They can be used for hedging, specu- 
lation, and arbitrage. One of the risks faced by a company that trades derivatives 
is that an employee who has a mandate to hedge or to look for arbitrage oppor- 
tunities may become a speculator. 

Jérôme Kerviel joined Société Générale (SocGen) in 2000 to work in the 
compliance area. In 2005, he was promoted and became a junior trader in the 
bank’s Delta One products team. He traded equity indices such as the German 
DAX index, the French CAC 40, and the Euro Stoxx 50. His job was to look 
for arbitrage opportunities. These might arise if a futures contract on an equity 
index was trading for a different price on two different exchanges. They might 
also arise if equity index futures prices were not consistent with the prices of the 
shares constituting the index. 

Kerviel used his knowledge of the bank’s procedures to speculate while giv- 
ing the appearance of arbitraging. He took big positions in equity indices and 
created fictitious trades to make it appear that he was hedged. In reality, he had 
large bets on the direction in which the indices would move. The size of his 
unhedged position grew over time to tens of billions of euros. 

In January 2008, his unauthorized trading was uncovered by SocGen. Over 
a three-day period, the bank unwound his position for a loss of 4.9 billion euros. 
This was, at the time, the biggest loss created by fraudulent activity in the history 
of finance. (Later in the year, much bigger losses from Bernard Madoft’s Ponzi 
scheme came to light.) 

Rogue trader losses were not unknown at banks prior to 2008. For example, 
in the 1990s Nick Leeson, who worked at Barings Bank, had a similar man- 
date to Jérôme Kerviel’s. His job was to arbitrage between Nikkei 225 futures 
quotes in Singapore and Osaka. Instead he found a way to make big bets on 
the direction of the Nikkei 225 using futures and options, losing $1 billion and 
destroying the 200-year-old bank. In 2002, it was found that John Rusnak at 
Allied Irish Bank had lost $700 million from unauthorized foreign exchange 
trading. In 2011, Kweku Adoboli,a member of UBS’s Delta One team, lost $2.3 
billion by engaging in activities very similar to those of Jérôme Kerviel. The 
lesson from these losses is that it is important to define unambiguous risk limits 
for traders and then be very careful when monitoring what they do to make sure 
that the limits are adhered to. 
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This chapter has reviewed spot trades, forwards, futures, swaps, and options contracts. 
A forward or futures contract involves an obligation to buy or sell an asset at a certain 
time in the future for a certain price. A swap is an agreement to exchange cash flows in 
the future in amounts dependent on the values of one or more market variables. There 
are two types of options: calls and puts. A call option gives the holder the right to buy 
an asset by a certain date for a certain price. A put option gives the holder the right to 
sell an asset by a certain date for a certain price. 

Forward, futures, and swap contracts have the effect of locking in the prices that 
will apply to future transactions. Options by contrast provide insurance. They ensure 
that the price applicable to a future transaction will not be worse than a certain level. 
Exotic options and structured products are tailored to the particular needs of corporate 
treasurers. For example, as shown in Business Snapshot 5.3, Asian basket options can allow 
a company such as Microsoft to hedge its net exposure to several risks over a period of 
time. Derivatives now trade on a wide variety of variables. This chapter has reviewed 
those that provide payofts dependent on the weather, oil, natural gas, and electricity. It 
has also discussed exotic options and structured products. 
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Practice Questions and Problems 
(Answers at End of Book) 


5.1 What is the difference between a long forward position and a short forward posi- 
tion? 

5.2 Explain carefully the difference between hedging, speculation, and arbitrage. 

5.3 What is the difference between entering into a long forward contract when the 
forward price is $50 and taking a long position in a call option with a strike price 
of $50? 

5.4 Explain carefully the difference between selling a call option and buying a put 
option. 

5.5 An investor enters into a short forward contract to sell 100,000 British pounds for 
US. dollars at an exchange rate of 1.3000 U.S. dollars per pound. How much does 
the investor gain or lose if the exchange rate at the end of the contract is (a) 1.3900 


and (b) 1.3200? 


Trading in Financial Markets 123 


5.6 


5.7 


5.8 


5.9 


5.10 


A trader enters into a short cotton futures contract when the futures price is 50 
cents per pound. The contract is for the delivery of 50,000 pounds. How much 
does the trader gain or lose if the cotton price at the end of the contract is (a) 
48.20 cents per pound and (b) 51.30 cents per pound? 

Suppose you write a put contract with a strike price of $40 and an expiration date 
in three months. The current stock price is $41 and the contract is on 100 shares. 
What have you committed yourself to? How much could you gain or lose? 

What is the difference between the over-the-counter market and the exchange- 
traded market? Which of the two markets do the following trade in: (a) a forward 
contract, (b) a futures contract, (c) an option, (d) a swap, and (e) an exotic option? 
You would like to speculate on a rise in the price of a certain stock. The current 
stock price is $29, and a three-month call with a strike of $30 costs $2.90. You have 
$5,800 to invest. Identify two alternative strategies, one involving an investment 
in the stock and the other involving an investment in the option. What are the 
potential gains and losses from each? 

Suppose that you own 5,000 shares worth $25 each. How can put options be used 
to provide you with insurance against a decline in the value of your holding over 
the next four months? 

When first issued, a stock provides funds for a company. Is the same true of a stock 
option? Discuss. 

Suppose that a March call option to buy a share for $50 costs $2.50 and is held 
until March. Under what circumstances will the holder of the option make a profit? 
Under what circumstances will the option be exercised? 

Suppose that a June put option to sell a share for $60 costs $4 and is held until 
June. Under what circumstances will the seller of the option (i.e., the party with 
the short position) make a profit? Under what circumstances will the option be 
exercised? 

A company knows that it is due to receive a certain amount of a foreign currency 
in four months. What type of option contract is appropriate for hedging? 

A United States company expects to have to pay 1 million Canadian dollars in six 
months. Explain how the exchange rate risk can be hedged using (a) a forward 
contract and (b) an option. 

In the 1980s, Bankers Trust developed index currency option notes (ICONS). 
These are bonds in which the amount received by the holder at maturity varies 
with a foreign exchange rate. One example was its trade with the Long-Term 
Credit Bank of Japan. The ICON specified that if the yen—U.S. dollar exchange 
rate, Sy, is greater than 169 yen per dollar at maturity (in 1995), the holder of the 
bond receives $1,000. If it is less than 169 yen per dollar, the amount received by 
the holder of the bond is 


1,000 — max [o 1,000 (2 - i)| 
Sr 


When the exchange rate is below 84.5, nothing is received by the holder at maturity. 
Show that this ICON is a combination of a regular bond and two options. 
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5.17 


5.18 


5.19 


5.20 


5.21 


5.22 


5.23 


5.24 


5.25 


5.26 


5.27 


5.28 


5.29 


Suppose that USD-sterling spot and forward exchange rates are as follows: 


Spot 1.3080 
90-day forward 1.3056 
180-day forward 1.3018 


What opportunities are open to an arbitrageur in the following situations? 
(a) A 180-day European call option to buy £1 for $1.27 costs 2 cents. 
(b) A 90-day European put option to sell £1 for $1.34 costs 2 cents. 
A company has money invested at 3% for five years. It wishes to use the swap quotes 
in Table 5.5 to convert its investment to a floating-rate investment. Explain how it 
can do this. 
A company has borrowed money for five years at 5%. Explain how it can use the 
quotes in Table 5.5 to convert this to a floating-rate liability. 
A company has a floating-rate liability that costs LIBOR plus 1%. Explain how it 
can use the quotes in Table 5.5 to convert this to a three-year fixed-rate liability. 
A corn farmer argues: “I do not use futures contracts for hedging. My real risk is 
not the price of corn. It is that my whole crop gets wiped out by the weather.” 
Discuss this viewpoint. Should the farmer estimate his or her expected production 
of corn and hedge to try to lock in a price for expected production? 
An airline executive has argued: “There is no point in our hedging the price of jet 
fuel. There is just as much chance that we will lose from doing this as that we will 
gain.” Discuss the executive’s viewpoint. 
Why is the cost of an Asian basket put option to Microsoft considerably less than 
the cost of a portfolio of put options, one for each currency and each maturity (see 
Business Snapshot 5.3)? 
“Oil, gas, and electricity prices tend to exhibit mean reversion.” What do you think 
is meant by this statement? Which energy source is likely to have the highest rate 
of mean reversion? Which is likely to have the lowest? 
Does a knock-out barrier call option become more or less valuable as the frequency 
with which the barrier is observed is increased? 
Suppose that each day during July the minimum temperature is 68° Fahrenheit and 
the maximum temperature is 82° Fahrenheit. What is the payoff from a call option 
on the cumulative CDD during July with a strike of 250 and a payment rate of 
$5,000 per degree day? 
Explain how a 5 X8 option contract on electricity for May 2019 with daily exer- 
cise works. Explain how a 5X 8 option contract on electricity for May 2019 with 
monthly exercise works. Which is worth more? 
A U.S. investor writes five call option contracts (i.e., options to buy 500 shares). 
The option price is $3.50, the strike price is $60, and the stock price is $57. What 
is the initial margin requirement? 
A trader shorts 500 shares of a stock when the price is $50. The initial margin is 
160% and the maintenance margin is 130%. How much margin is required from 
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the investor initially? How high does the price of the stock have to rise for there 
to be a margin call? 

What is the difference between the margin required by an exchange from one of 
its members for a future contract and the margin required by a broker from one of 
its clients? 


Further Questions 


A company enters into a short futures contract to sell 5,000 bushels of wheat for 
250 cents per bushel. The initial margin is $3,000 and the maintenance margin is 
$2,000. What price change would lead to a margin call? Under what circumstances 
could $1,500 be withdrawn from the margin account? 

A trader buys 200 shares of a stock on margin. The price of the stock is $20 per 
share. The initial margin is 60% and the maintenance margin is 30%. How much 
money does the trader have to provide initially? For what share price is there a 
margin call? 

The current price of a stock is $94, and three-month European call options with a 
strike price of $95 currently sell for $4.70. An investor who feels that the price of 
the stock will increase is trying to decide between buying 100 shares and buying 
2,000 call options (= 20 contracts). Both strategies involve an investment of $9,400. 
What advice would you give? How high does the stock price have to rise for the 
option strategy to be more profitable? 

A bond issued by Standard Oil worked as follows. The holder received no interest. 
At the bond’s maturity the company promised to pay $1,000 plus an additional 
amount based on the price of oil at that time. The additional amount was equal to 
the product of 170 and the excess (if any) of the price of a barrel of oil at maturity 
over $25. The maximum additional amount paid was $2,550 (which corresponds 
to a price of $40 per barrel). Show that the bond is a combination of a regular 
bond, a long position in call options on oil with a strike price of $25, and a short 
position in call options on oil with a strike price of $40. 

The price of gold is currently $1,500 per ounce. The forward price for delivery 
in one year is $1,700. An arbitrageur can borrow money at 5% per annum. What 
should the arbitrageur do? Assume that the cost of storing gold is zero and that 
gold provides no income. 

A company’s investments earn LIBOR minus 0.5%. Explain how it can use the 
quotes in Table 5.5 to convert them to (a) three-, (b) five-, and (c) ten-year fixed- 
rate investments. 

What position is equivalent to a long forward contract to buy an asset at K on a 
certain date and a long position in a European put option to sell it for K on that 
date? 

Estimate the interest rate paid by P&G on the 5/30 swap in Business Snapshot 5.4 if 
(a) the CP rate is 6.5% and the Treasury yield curve is flat at 6% and (b) the CP rate 
is 7.5% and the Treasury yield curve is flat at 7% with semiannual compounding. 


Chapter 6 


The Credit Crisis of 
2007—2008 


his chapter has two objectives. The first is to examine the origins of the 2007— 
2008 financial crisis, what went wrong, and the lessons that can be learned. The 
second is to explain how securitization works. 

Starting in 2007, the United States experienced the worst financial crisis since the 
1930s. The crisis spread rapidly from the United States to other countries and from 
financial markets to the real economy. Some financial institutions failed. Many more had 
to be bailed out by national governments. There can be no question that the first decade 
of the twenty-first century was disastrous for the financial sector. Risk management has 
now assumed a much greater importance in financial institutions. Also, as we shall see in 
later chapters, the crisis has led to a major overhaul of the way financial institutions are 
regulated. 

Chapter 5 covered futures, forwards, swaps, and options. These are some of the ways 
in which risks can be transferred from one entity to another in the economy. A further 
important way of transferring risk is securitization. This chapter explains asset-backed 
securities (ABSs) and collateralized debt obligations (CDOs) and discusses their role in 
the crisis. 
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6.1 The U.S. Housing Market 


A natural starting point for a discussion of the credit crisis of 2007—2008 is the US. 
housing market. Figure 6.1 shows the S&P/Case-Shiller composite-10 index for house 
prices in the United States between January 1987 and March 2017. This tracks house 
prices for ten major metropolitan areas in the United States. In about the year 2000, 
house prices started to rise much faster than they had in the previous decade. The very 
low level of interest rates between 2002 and 2005 was an important contributory factor, 
but the bubble in house prices was largely fueled by mortgage lending practices. 

The 2000 to 2006 period was characterized by a huge increase in what is termed 
subprime mortgage lending. Subprime mortgages are mortgages that are considered to be 
significantly more risky than average. Before 2000, most mortgages classified as subprime 
were second mortgages. After 2000, this changed as financial institutions became more 
comfortable with the notion of a subprime first mortgage. 


6.1.1 The Relaxation of Lending Standards 


Mortgage lenders in the United States started to relax their lending standards in about 
2000. This made house purchases possible for many families that had previously been 
considered to be not sufficiently creditworthy to qualify for a mortgage. These families 
increased the demand for real estate and prices rose. To mortgage brokers and mortgage 
lenders, the combination of more lending and rising house prices was attractive. More 
lending meant bigger profits. Rising house prices meant that the lending was well covered 
by the underlying collateral. If the borrower defaulted, the resulting foreclosure would 
lead to little or no loss. 

How could mortgage brokers and mortgage lenders keep increasing their profits? 
Their problem was that, as house prices rose, it was more difficult for first-time buyers to 
afford a house. In order to continue to attract new entrants to the housing market, they 
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Figure 6.1 The S&P/Case-Shiller Composite-10 Index of U.S. Real Estate Prices (not seasonally 
adjusted), 1987 to March 2017 
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had to find ways to relax their lending standards even more—and this is exactly what 
they did. The amount lent as a percentage of the house price increased. Adjustable rate 
mortgages (ARMs) were developed where there was a low teaser rate of interest that 
would last for two or three years and be followed by a rate that was liable to be much 
higher.’ Lenders also became more cavalier in the way they reviewed mortgage applica- 
tions. Indeed, the applicant’s income and other information reported on the application 
were frequently not checked. 

Why was the government not regulating the behavior of mortgage lenders? The 
answer is that the U.S. government had, since the 1990s, been trying to expand home 
ownership, and had been applying pressure to mortgage lenders to increase loans to low- 
and moderate-income households. Some state legislators (such as those in Ohio and 
Georgia) were concerned about what was going on and wanted to curtail predatory 
lending.” However, the courts decided that national standards should prevail. 

A number of terms have been used to describe mortgage lending during the period 
leading up to the credit crisis. One is “liar loans,” because individuals applying for a 
mortgage, knowing that no checks would be carried out, sometimes chose to lie on 
the application form. Another term used to describe some borrowers is “NINJA” (no 
income, no job, no assets). Some analysts realized that the mortgages were risky, but 
pricing in the market for securities created from the mortgages suggests that the full 
extent of the risks and their potential impact on markets was not appreciated until well 
into 2007. 

Mian and Sufi (2009) have carried out research confirming that there was a relaxation 
of the criteria used for mortgage lending? Their research defines “high denial zip codes” 
as zip codes where a high proportion of mortgage applicants had been turned down in 
1996, and shows that mortgage origination grew particularly fast for these zip codes 
between 2000 to 2007. (Zip codes are postal codes in the United States defining the area 
in which a person lives.) Moreover, their research shows that lending criteria were relaxed 
progressively through time rather than all at once because originations in high-denial zip 
codes were an increasing function of time during the 2000 to 2007 period. Zimmerman 
(2007) provides some confirmation of this He shows that subsequent default experi- 
ence indicates that mortgages made in 2006 were of a lower quality than those made 
in 2005 and these were in turn of lower quality than mortgages made in 2004. Stan- 
dard & Poor’s has estimated that subprime mortgage origination in 2006 alone totaled 


1A “2/28” ARM, for example, is an ARM where the rate is fixed for two years and then floats for 
the remaining 28 years. If real estate prices increased, lenders expected the borrowers to prepay and 
take out a new mortgage at the end of the teaser rate period. However, prepayment penalties, often 
zero on prime mortgages, were quite high on subprime mortgages. 

? Predatory lending describes the situation where a lender deceptively convinces borrowers to agree 
to unfair and abusive loan terms. 

See A. Mian and A. Sufi, “The Consequences of Mortgage Credit Expansion: Evidence from the US 
Mortgage Default Crisis,’ Quarterly Journal of Economics 124, no. 4 (November 2009): 1449-1496. 
+See T. Zimmerman, “The Great Subprime Meltdown,” Journal of Structured Finance (Fall 2007): 7-20. 
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$421 billion. AMP Capital Investors estimated that there was a total of $1.4 trillion of 
subprime mortgages outstanding in July 2007. 


6.1.2 The Bubble Bursts 


The result of the relaxation of lending standards was an increase in the demand for 
houses and a bubble in house prices. Prices increased very fast during the 2000 to 2006 
period. All bubbles burst eventually and this one was no exception. In the second half of 
2006, house prices started to edge down. One reason was that, as house prices increased, 
demand for houses declined. Another was that some borrowers with teaser rates found 
that they could no longer afford their mortgages when the teaser rates ended. This led 
to foreclosures and an increase in the supply of houses for sale. The decline in house 
prices fed on itself. Individuals who had borrowed 100%, or close to 100%, of the cost 
of a house found that they had negative equity (i.e., the amount owed on the mortgage 
was greater than the value of the house). Some of these individuals chose to default. This 
led to more foreclosures, a further increase in the supply of houses for sale, and a further 
decline in house prices. 

One of the features of the U.S. housing market is that mortgages are non-recourse in 
some states. This means that, when there is a default, the lender is able to take possession 
of the house, but other assets of the borrower are off-limits? Consequently, the borrower 
has a free American-style put option. He or she can at any time sell the house to the lender 
for the principal outstanding on the mortgage. (During the teaser-interest-rate period 
this principal typically increased, making the option more valuable.) Market participants 
realized belatedly how costly the put option could be. If the borrower had negative 
equity, the optimal decision was to exchange the house for the outstanding principal 
on the mortgage. The house was then sold, adding to the downward pressure on house 
prices. 

It would be a mistake to assume that all mortgage defaulters were in the same position. 
Some were unable to meet mortgage payments and suffered greatly when they had to 
give up their homes. But many of the defaulters were speculators who bought multiple 
homes as rental properties and chose to exercise their put options. It was their tenants 
who suffered. There are also reports that some house owners (who were not speculators) 
were quite creative in extracting value from their put options. After handing the keys to 
their house to the lender, they turned around and bought (sometimes at a bargain price) 
another house that was in foreclosure. Imagine two people owning identical houses next 
to each other. Both have mortgages of $250,000. Both houses are worth $200,000 and in 
foreclosure can be expected to sell for $170,000. What is the owners’ optimal strategy? 
The answer is that each person should exercise the put option and buy the neighbor’s 
house. 


>In some other states, mortgages are not non-recourse but there is legislation making it difficult for 
lenders to take possession of other assets besides the house. 
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As foreclosures increased, the losses on mortgages also increased. Losses were high 
because houses in foreclosure were often surrounded by other houses that were also for 
sale. They were sometimes in poor condition. In addition, banks faced legal and other 
fees. In normal market conditions, a lender can expect to recover 75% of the amount 
owed in a foreclosure. In 2008 and 2009, recovery rates as low as 25% were experienced 
in some areas. 

The United States was not alone in having declining real estate prices. Prices declined 
in many other countries as well. Real estate in the United Kingdom was particularly badly 
affected. As Figure 6.1 indicates, average house prices recovered almost to their previous 
peak in the United States between mid-2012 and March 2017. 


6.2 Securitization 


The originators of mortgages did not in many cases keep the mortgages themselves. They 
sold portfolios of mortgages to companies that created products for investors from them. 
This process is known as securitization. Securitization has been an important and useful 
tool for transferring risk in financial markets for many years. It underlies the originate- 
to-distribute model that was widely used by banks prior to 2007 and is discussed in 
Chapter 2. 

Securitization played a part in the creation of the housing bubble. The behavior 
of mortgage originators was influenced by their knowledge that mortgages would be 
securitized.° When considering new mortgage applications, the question was not: “Is 
this a credit we want to assume?” Instead it was: “Is this a mortgage we can make money 
from by selling it to someone else?” 

When mortgages were securitized, the only information received about the mort- 
gages by the buyers of the products that were created from them was the loan-to-value 
ratio (i.e., the ratio of the size of the loan to the assessed value of the house) and the 
borrower’s FICO (credit) score.’ The reason why lenders did not check information on 
things such as the applicant’s income, the number of years the applicant had lived at his 
or her current address, and so on, was that this information was considered irrelevant. 
The most important thing for the lender was whether the mortgage could be sold to 
others—and this depended primarily on the loan-to-value ratio and the applicant’s FICO 
score. 

It is interesting to note in passing that both the loan-to-value ratio and the FICO 
score were of dubious quality. The property assessors who determined the value of a 
house at the time of a mortgage application sometimes inflated valuations because they 


© Research by Keys et al. shows that there was a link between securitization and the lax screening of 
mortgages. See B.J. Keys, T. Mukherjee, A. Seru, and V. Vig, “Did Securitization Lead to Lax Screening? 
Evidence from Subprime Loans,” Quarterly Journal of Economics 125, no. 1 (February 2010): 307—362. 

7FICO is a credit score developed by the Fair Isaac Corporation and is widely used in the United 
States. It ranges from 300 to 850. 


132 RISK MANAGEMENT AND FINANCIAL INSTITUTIONS 


ABS 
Asset 1 Senior tranche 
Asset 2 Principal: $75 million 
Asset 3 Return = 6% 
Mezzanine tranche 
SPV > Principal: $20 million 

Return = 10% 
Asset n 

Equity tranche 
Principal: Principal: $5 million 
$100 million Return = 30% 


Figure 6.2 Creation of an Asset-Backed Security from a Portfolio of Assets (simplified) 


knew that the lender wanted a low loan-to-value ratio. Potential borrowers were some- 
times counseled to take certain actions that would improve their FICO scores’ 
We now consider the products that were created from the mortgages and sold in the 


market. 


6.2.1 Asset-Backed Securities 


An asset-backed security (ABS) is a security created from the cash flows of financial assets 
such as loans, bonds, credit card receivables, mortgages, auto loans, and aircraft leases. 
Sometimes, cash flow streams such as royalties from the future sales of a piece of music 
are even used. The way the security works is illustrated in Figure 6.2. A portfolio of assets 
(such as subprime mortgages) is sold by the originators of the assets to a special purpose 
vehicle (SPV) and the cash flows from the assets are allocated to tranches. In Figure 6.2, 
there are three tranches. (This is a simplification. In reality there are usually many more 
than three tranches created.) These are the senior tranche, the mezzanine tranche, and the 
equity tranche. The portfolio has a principal of $100 million. This is divided as follows: 
$75 million to the senior tranche, $20 million to the mezzanine tranche, and $5 million 
to the equity tranche. The senior tranche is promised a return of 6%, the mezzanine 
tranche is promised a return of 10%, and the equity tranche is promised a return of 30%. 

It sounds as though the equity tranche has the best deal, but this is not necessarily 
the case. The equity tranche is much less likely to realize its return than the other two 
tranches. Cash flows are allocated to tranches by specifying what is known as a water- 
fall. An approximation to the way a waterfall works is in Figure 6.3. There is a separate 


8 One such action might be to make regular payments on a credit card for a few months. 
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Figure 6.3 The Waterfall in an Asset-Backed Security 


waterfall for interest and principal cash flows. Interest cash flows from the assets are allo- 
cated to the senior tranche until the senior tranche has received its promised return on 
its outstanding principal. Assuming that the promised return to the senior tranche can be 
made, cash flows are then allocated to the mezzanine tranche. If the promised return to 
the mezzanine tranche on its outstanding principal can be made and interest cash flows 
are left over, they are allocated to the equity tranche. Principal cash flows are used first 
to repay the principal of the senior tranche, then the mezzanine tranche, and finally the 
equity tranche? 

The structure in Figure 6.2 typically lasts several years. The extent to which the 
tranches get their principal back depends on losses on the underlying assets. The first 5% 
of losses are borne by the principal of the equity tranche. If losses exceed 5%, the equity 
tranche loses all its principal and some losses are borne by the principal of the mezzanine 
tranche. If losses exceed 25%, the mezzanine tranche loses all its principal and some losses 
are borne by the principal of the senior tranche. 

There are therefore two ways of looking at an ABS. One is with reference to the 
waterfall in Figure 6.3. Cash flows go first to the senior tranche, then to the mezzanine 
tranche, and then to the equity tranche. The other is in terms of losses. Losses of principal 
are first borne by the equity tranche, then by the mezzanine tranche, and then by the 
senior tranche. 

The ABS is designed so that the senior tranche is rated AAA. The mezzanine tranche 
is typically rated BBB. The equity tranche is typically unrated. Unlike the ratings assigned 
to bonds, the ratings assigned to the tranches of an ABS are what might be termed 
“negotiated ratings.” The objective of the creator of the ABS is to make the senior tranche 
as big as possible without losing its AAA credit rating. (This maximizes the profitability 
of the structure.) The ABS creator examines information published by rating agencies 
on how tranches are rated and may present several structures to rating agencies for a 


? The priority rule described here is a simplification. The precise waterfall rules are somewhat more 
complicated and outlined in a legal document several hundred pages long. 
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preliminary evaluation before choosing the final one. The creator of the ABS expects 
to make a profit because the weighted average return on the assets in the underlying 
portfolio is greater than the weighted average return offered to the tranches. 

A particular type of ABS is a collateralized debt obligation (CDO). This is an ABS where 
the underlying assets are fixed-income securities. The procedures used by the market to 
value a CDO are outlined in Appendix L. 


6.2.2 ABS CDOs 


Finding investors to buy the senior AAA-rated tranches created from subprime mort- 
gages was not difficult. Equity tranches were typically retained by the originator of the 
mortgages or sold to a hedge fund. Finding investors for the mezzanine tranches was 
more difficult. This led financial engineers to be creative (arguably too creative). Finan- 
cial engineers created an ABS from the mezzanine tranches of ABSs that were created 
from subprime mortgages. This is known as an ABS CDO or Mezz ABS CDO and 
is illustrated in Figure 6.4. (Like the ABS in Figure 6.3, this is simplified.) The senior 
tranche of the ABS CDO is rated AAA. This means that the total of the AAA-rated 
instruments created in the example that is considered here is 90% (75% plus 75% of 
20%) of the principal of the underlying mortgage portfolios. This seems high but, if 
the securitization were carried further with an ABS being created from the mezzanine 
tranches of ABS CDOs (and this did happen), the percentage would be pushed even 
higher. 

In the example in Figure 6.4, the AAA-rated tranche of the ABS would probably be 
downgraded in the second half of 2007. However, it would receive the promised return 
if losses on the underlying mortgage portfolios were less than 25% because all losses of 
principal would then be absorbed by the more junior tranches. The AAA-rated tranche 
of the ABS CDO in Figure 6.4 is much more risky. It will get paid the promised return 
if losses on the underlying portfolios are 10% or less because in that case mezzanine 
tranches of ABSs have to absorb losses equal to 5% of the ABS principal or less. As they 
have a total principal of 20% of the ABS principal, their loss is at most 5/20 or 25%. At 
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Figure 6.4 Creation of ABSs and an ABS CDO from Subprime Mortgages (simplified) 
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Table 6.1 Losses to Tranches in Figure 6.4 


Losses to Losses to Losses to Losses to Losses to 
Subprime Mezzanine Equity Tranche Mezzanine Tranche Senior Tranche 
Portfolios Tranche of ABS of ABS CDO of ABS CDO of ABS CDO 
10% 25% 100% 100% 0% 

15% 50% 100% 100% 33% 
20% 75% 100% 100% 67% 

25% 100% 100% 100% 100% 


worst this wipes out the equity tranche and mezzanine tranche of the ABS CDO but 
leaves the senior tranche unscathed. 

The senior tranche of the ABS CDO suffers losses if losses on the underlying portfo- 
lios are more than 10%. Consider, for example, the situation where losses are 20% on the 
underlying portfolios. In this case, losses on the mezzanine tranches of the ABS CDO are 
15/20 or 75% of their principal. The first 25% is absorbed by the equity and mezzanine 
tranches of the ABS CDO. The senior tranche of the ABS CDO therefore loses 50/75 
or 67% of its value. These and other results are summarized in Table 6.1. 

Many banks have lost money investing in the senior tranches of ABS CDOs. The 
investments typically promised a return quite a bit higher than the bank’s funding cost. 
Because they were rated AAA, the capital requirements were minimal. Merrill Lynch is 
an example ofa bank that lost a great deal of money from investments in ABS CDOs. In 
July 2008, Merrill Lynch agreed to sell senior tranches of ABS CDOs that had previously 
been rated AAA and had a principal of $30.6 billion to Lone Star Funds for 22 cents on 
the dollar.!® 


6.2.3 CDOs and ABS CDOs in Practice 


Figures 6.2 and 6.4 illustrate the nature of the securitizations that were done. In practice, 
many more tranches were created than those shown in Figures 6.2 and 6.4, and many of 
the tranches were thinner (i.e., corresponded to a narrower range of losses). Figure 6.5 
shows a more realistic example of the structures that were created. This is adapted from 
an illustration by Gorton, which was taken from an article by UBS.!! 

In Figure 6.5, two ABS CDOs are created. One (referred to as a Mezz ABS CDO) is 
created from the BBB-rated tranches of ABSs (similarly to the ABS CDO in Figure 6.4); 
the other (referred to as a high-grade ABS CDO) is from the AAA, AA, and A tranches of 
ABSs. The figure shows a third level of securitization based on the A and AA tranches of 
the Mezz ABS CDO. There was typically a small amount of overcollateralization with the 
face value of the mortgages being greater (by 1% or 2%) than the total face value of the 


10 In fact the deal was worse than it sounds for Merrill Lynch because Merrill Lynch agreed to finance 
75% of the purchase price. If the value of the tranches fell below 16.5 cents on the dollar, Merrill 
Lynch might have found itself owning the assets again. 

11G, Gorton, “The Subprime Panic,” European Financial Management 15, no. 1 (2008): 10-46. 
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Figure 6.5 More Realistic Example of Subprime Securitizations with ABS, ABS CDOs, and a 
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ABS tranches. This created a cushion for investors, but by carrying out a similar analysis 
to that in Table 6.1 it is not difficult to see that investors in many of the tranches created 
would lose principal in situations where losses on the underlying subprime mortgage 
portfolios were moderately high. 

The risks in the AAA-rated tranches of ABSs and ABS CDOs were higher than 
either investors or rating agencies realized. One of the reasons for this involves corre- 
lation. The values of the tranches of ABSs depend on the default correlation of the 
underlying mortgages. The tranches of ABS CDOs are even more heavily dependent on 
these default correlations. If mortgages exhibit a fairly low default correlation (as they do 
in normal times), there is very little chance of a high overall default rate and the AAA- 
rated tranches of both ABSs and ABS CDOs are safe. But, many analysts overlooked the 
fact that correlations always increase in stressed market conditions. In 2005 to 2006, the 
models used by investors and rating agencies assumed correlations that were too low for 
the upheavals in the U.S. housing market that were considered likely by many observers. 
As explained in Business Snapshot 6.1, another mistake made by analysts was to assume 
that the BBB-rated tranches of an ABS were equivalent in risk to BBB-rated bonds. 
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BUSINESS SNAPSHOT 6.1 
All BBBs Are Not the Same 


Analysts tended to assume that the mezzanine tranche of an ABS, when rated 
BBB, can be considered to be identical to a BBB bond for the purposes of 
evaluating a CDO created from the mezzanine tranches. This is not a good 
assumption. The rating agency models attempted to ensure that the BBB tranche 
of an ABS had the same probability of loss, or the same expected loss, as a BBB 
bond. But the probability distribution of the loss is very different. Because the 
BBB tranches of ABSs were thin, it is much more likely that an investor in the 
BBB tranche of an ABS will lose everything than that this will happen for an 
investor in a BBB-rated bond. (This is sometimes referred to as “cliff risk.’) 
This means that the risk characteristics of ABS CDO tranches created from the 
BBB tranches of ABSs are quite different from the risk characteristics of similar 
tranches created from BBB bonds. 

One lesson from this is that it is dangerous to interpret ratings for tranches 
of an ABS—or any other structured product—in the same way that ratings for 
bonds are interpreted. For similarly rated bonds and structured products, the 
probability distributions of losses are markedly different. 


There are important differences between the two and these differences can have a big 
effect on the valuation of the tranches of ABS CDOs. 


6.3 The Losses 


The defaults on mortgages in the United States had a number of consequences. Financial 
institutions and other investors who had bought the tranches of ABSs and ABS CDOs 
lost money. Losses were also incurred by some mortgage originators because they had 
provided guarantees as to the quality of the mortgages that were securitized and because 
they faced lawsuits over their lending practices. 

As often happens when losses are experienced in one segment of the debt market, 
there was a “flight to quality.” Investors became reluctant to take any credit risk and 
preferred to buy Treasury instruments and similarly safe investments. Credit spreads (the 
extra return required for taking credit risks) increased sharply. It was difficult for many 
non-financial companies to obtain loans from banks. Indeed, banks became reluctant to 
lend to each other at all and interbank lending rates increased sharply. 

The tranches of ABSs and ABS CDOs were downgraded by rating agencies in the 
second half of 2007. The market for these tranches became very illiquid. Investors realized 
that they did not understand the tranches as well as they had previously thought and that 
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they had placed too much reliance on credit ratings. This emphasizes the importance 
of transparency in financial markets. The products created during the period leading up 
to the crisis were very complicated? Investors did not worry about this until problems 
emerged. They then found that the liquidity of the market was such that they could only 
trade at fire-sale prices. 

Banks such as Citigroup, UBS, and Merrill Lynch suffered huge losses. There were 
many government bailouts of financial institutions. Lehman Brothers was allowed to fail. 
The world experienced the worst recession since the 1930s. Unemployment increased. 
Even people in remote parts of the world that had no connection with U.S. financial 
institutions were affected. 

Banks are now paying a price for the crisis. As we shall see in Chapter 16, they are 
required to keep more capital than before. They are also required to maintain certain liq- 
uidity ratios. Legislation such as Dodd—Frank in the United States increases the oversight 
of financial institutions and restricts their activities in areas such as proprietary trading 
and derivatives trading. 


6.4 What Went Wrong? 


“Irrational exuberance” is a phrase coined by Alan Greenspan, Chairman of the Federal 
Reserve Board, to describe the behavior of investors during the bull market of the 1990s. 
It can also be applied to the period leading up to the credit crisis. Mortgage lenders, 
the investors in tranches of ABSs and ABS CDOs that were created from residential 
mortgages, and the companies that sold protection on the tranches assumed that the 
U.S. house prices would continue to increase—or at least not decrease. There might 
be declines in one or two areas, but the possibility of the widespread decline shown in 
Figure 6.1 was a scenario not considered by most people. 

Many factors contributed to the crisis that started in 2007. Mortgage originators 
used lax lending standards. Products were developed to enable mortgage originators to 
profitably transfer credit risk to investors. Rating agencies moved from their traditional 
business of rating bonds, where they had a great deal of experience, to rating structured 
products, which were relatively new and for which there were relatively little historical 
data. The products bought by investors were complex and in many instances investors 
and rating agencies had inaccurate or incomplete information about the quality of the 
underlying assets. Investors in the structured products that were created thought they 
had found a money machine and chose to rely on rating agencies rather than forming 
their own opinions about the underlying risks. The return promised on the structured 
products rated AAA was high compared with the returns promised on bonds rated AAA. 


'2Some of the products that were created were even more complicated than indicated by the descrip- 
tion in Section 6.2. For example, sometimes ABS CDO tranches were included in the portfolios used 
to create other ABS CDOs. 
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6.4.1 Regulatory Arbitrage 


Many of the mortgages were originated by banks and it was banks that were the main 
investors in the tranches that were created from the mortgages. Why would banks 
choose to securitize mortgages and then buy the securitized products that were cre- 
ated? The answer concerns what is termed regulatory arbitrage. The regulatory capital 
banks were required to keep for the tranches created from a portfolio of mortgages was 
less than the regulatory capital that would be required for the mortgages themselves. 
This is because the mortgages were kept in what is referred to as the “banking book,” 
whereas the tranches were kept in what is referred to as the “trading book.” Capital 
requirements were different for the banking book and the trading book. We will discuss 
this point further in Chapters 15 to 18. 


6.4.2 Incentives 


Economists use the term “agency costs” to describe the situation where incentives are 
such that the interests of two parties in a business relationship are not perfectly aligned. 
The process by which mortgages were originated, securitized, and sold to investors was 
unfortunately riddled with agency costs. 

The incentive of the originators of mortgages was to make loans that would 
be acceptable to the creators of the ABS and ABS CDO tranches. The incentive of the 
individuals who valued houses on which the mortgages were written was to please the 
lender by providing as high a valuation as possible so that the loan-to-value ratio was as 
low as possible. (Pleasing the lender was likely to lead to more business from that lender.) 
The main concern of the creators of ABSs and ABS CDOs was the profitability of the 
structures (i.e., the excess of the weighted average inflows over the weighted average out- 
flows). They wanted the volume of AAA-rated tranches that they created to be as high 
as possible and found ways of using the published criteria of rating agencies to achieve 
this. The rating agencies were paid by the issuers of the securities they rated and about 
half their income came from structured products. 

Another source of agency costs concerns financial institutions and their employees. 
Employee compensation falls into three categories: regular salary, the end-of-year bonus, 
and stock or stock options. Many employees at all levels of seniority in financial insti- 
tutions, particularly traders, receive much of their compensation in the form of end-of- 
year bonuses. Traditionally, this form of compensation has focused employee attention on 
short-term performance. If an employee generates huge profits one year and is respon- 
sible for severe losses the next year, the employee will receive a big bonus the first year 
and will not have to return it the following year. The employee might lose his or her job 
as a result of the second-year losses, but even that is not a disaster. Financial institutions 
seem to be surprisingly willing to recruit individuals with losses on their resumes. 

Imagine you are an employee of a financial institution investing in ABS CDOs in 
2006. Almost certainly you would have recognized that there was a bubble in the USS. 
housing market and would expect that bubble to burst sooner or later. However, it is 
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possible that you would decide to continue with your ABS CDO investments. If the 
bubble did not burst until after December 31, 2006, you would still get a nice bonus at 
the end of 2006! 


6.5 Lessons from the Crisis 


Some of the lessons for risk managers from the crisis are as follows: 


1. Risk managers should be watching for situations where there is irrational exuberance 
and make sure that senior management recognize that the good times will not last 
forever. 

2. Correlations always increase in stressed markets. In considering how bad things might 
get, risk managers should not use correlations that are estimated from data collected 
during normal market conditions. 

3. Recovery rates decline when default rates increase. This is true for almost all debt 
instruments, not just mortgages. (See Section 19.3.) In considering how bad things 
might get, risk managers should not use recovery rates that are estimated from data 
collected during normal market conditions. 

4. Risk managers should ensure that the incentives of traders and other personnel 
encourage them to make decisions that are in the interests of the organization they 
work for. Many financial institutions have revised their compensation policies as a 
result of the crisis. Bonuses are now often spread out over several years rather than all 
being paid at once. If good performance in one year is followed by bad performance 
in the next, part of the bonus for the good-performance year that has not yet been 
paid may be clawed back. 

5. If a deal seems too good to be true, it probably is. AAA-rated tranches of structured 
products promised returns that were higher than the returns promised on AAA bonds 
by 100 basis points or more. A sensible conclusion from this for an investor would 
be that further analysis is needed because there are likely to be risks in the tranches 
that are not considered by rating agencies. 

6. Investors should not rely on ratings. They should understand the assumptions made 
by rating agencies and carry out their own analyses. 

7. Transparency is important in financial markets. If there is a lack of transparency (as 
there was for ABS CDOs), markets are liable to dry up when there is negative news. 

8. Re-securitization, which led to the creation of ABS CDOs and CDOs of CDOs, was 
a badly flawed idea. The assets used to create ABSs in the first leg of the securitization 
should be as well diversified as possible. There is then nothing to be gained from 
further securitization. 


Business Snapshot 6.1 makes the point that many market participants incorrectly 
considered ABS tranches rated BBB to be equivalent to BBB bonds. Business Snap- 
shot 6.2 suggests a trading strategy that could be followed by people who realized that 
this was not so. 
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BUSINESS SNAPSHOT 6.2 
A Trading Opportunity? 


A few traders made a huge amount of money betting against the subprime mort- 
gage market. Suppose that you are analyzing markets in 2005 and 2006, but are 
uncertain about how subprime mortgages will perform. Is there a trading oppor- 
tunity open to you? 

The answer is that Mezz ABS CDOs do present a trading opportunity. Fig- 
ure 6.5 is a simplification of how tranches were actually created. In practice, 
there were usually three ABS tranches, rated BBB+, BBB, and BBB-. Each was 
very thin—about 1% wide. Separate Mezz ABS CDOs were created from each 
of the three types of tranches. Consider the Mezz ABS CDO created from BBB+ 
tranches. After analyzing the mortgages in the pools used to create the BBB+ 
tranches, a trader would conclude that the pools had similar risk profiles and that 
there would be a tendency for either all tranches to be safe or all to be wiped out. 
(Because the tranches are only 1% wide, it is unlikely that they would be only 
partially wiped out.) This means that all the Mezz ABS CDO tranches created 
from ABS tranches rated BBB+ are either safe or wiped out. The Mezz ABS 
CDO tranches are therefore much the same as each other and should have the 
same rating (BBB+ in the case we are considering). 

Having recognized this, what should the trader do? He or she should buy 
junior ABS CDO tranches (which are inexpensive because of their rating) and 
short senior ABS CDO tranches (which are relatively expensive). If the underly- 
ing principal is the same for both trades, the trader can then relax knowing that 
a profit has been locked in. 

This emphasizes the point in Business Snapshot 6.1 that BBB tranches (par- 
ticularly very thin BBB tranches) should not be considered equivalent to BBB 
bonds when a securitization is planned. 


Summary 


The credit crisis starting in 2007 had a devastating effect on financial markets throughout 
the world. Its origins can be found in the U.S. housing market. The U.S. government was 
keen to encourage home ownership. Interest rates were low. Mortgage brokers and mort- 
gage lenders found it attractive to do more business by relaxing their lending standards. 
Products for securitizing mortgages had been developed so that the investors bearing the 
credit risk were not necessarily the same as the original lenders. Rating agencies were 
prepared to give an AAA rating to senior tranches that were created by securitization. 
There was no shortage of buyers for these AAA-rated tranches because their yields were 
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higher than the yields on AAA-rated bonds. The compensation arrangements in banks 
focused their employees’ attention on short-term profits, and as a result they chose to 
ignore the housing bubble and its potential impact on some very complex products they 
were trading. 

House prices rose as both first-time buyers and speculators entered the market. Some 
mortgages had included a low “teaser rate” for two or three years. After the teaser rate 
ended, some borrowers faced higher interest rates that they could not afford and had no 
choice but to default. This led to foreclosures and an increase in the supply of houses 
being sold. The price increases between 2000 and 2006 began to be reversed. Speculators 
and others who found that the amount owed on their mortgages was greater than the 
value of their houses (i.e., they had negative equity) defaulted. This accentuated the price 
decline. 

Many factors played a part in creating the U.S. housing bubble and resulting recession. 
These include irrational exuberance on the part of market participants, poor incentives, 
too much reliance on rating agencies, not enough analysis by investors, and the complex- 
ity of the products that were created. The crisis has provided a number of lessons for risk 
managers. As we will see later in this book, it has also led to a major overhaul of bank 
regulation and bank legislation. 
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Practice Questions and Problems 
(Answers at End of Book) 


6.1 Why did mortgage lenders frequently not check on information in the mortgage 
application during the 2000 to 2007 period? 

6.2 Why do you think the increase in house prices during the 2000 to 2007 period is 
referred to as a bubble? 

6.3 What are the numbers in Table 6.1 for a loss rate of (a) 5% and (b) 12%? 
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6.4 


6.5 
6.6 
6.7 
6.8 
6.9 
6.10 


6.11 
6.12 


6.13 


6.14 


6.15 


In what ways are the risks in the tranche of an ABS different from the risks in a 
similarly rated bond? 

Explain the difference between (a) an ABS and (b) an ABS CDO. 

How were the risks in ABS CDOs misjudged by the market? 

What is meant by the term “agency costs”? 

What is a waterfall in a securitization? 

How is an ABS CDO created? What was the motivation to create ABS CDOs? 
How did Mian and Sufi show that mortgage lenders relaxed their lending criteria 
during the 2000 to 2006 period? 

What is a mezzanine tranche? 

Explain the influence of an increase in default correlation on (a) the risks in the 
equity tranche of an ABS and (b) the risks in the senior tranches of an ABS. 
Explain why the end-of-year bonus has, in the past, been regarded as providing 
incentives for employees to think only about the short term. 


Further Questions 


Suppose that the principals assigned to the senior, mezzanine, and equity tranches 
are 70%, 20%, and 10% instead of 75%, 20% and 5% for both ABSs and ABS CDOs 
in Figure 6.4. How are the results in Table 6.1 affected? 

Investigate what happens as the width of the mezzanine tranche of the ABS in 
Figure 6.4 is decreased, with the reduction in the mezzanine tranche principal 
being divided equally between the equity and senior tranches. In particular, what 
is the effect on Table 6.1? 


Chapter 7 


Valuation and Scenario 


Analysis: The Risk-Neutral 
and Real Worlds 


aluation and scenario analysis are two important activities for financial institu- 

tions. Both are concerned with estimating future cash flows, but they have dif- 

ferent objectives. In valuation, a financial institution is interested in estimating 
the present value of future cash flows. It does this by calculating the expected values (i.e., 
average values) of the future cash flows across all alternative outcomes and discounting 
the expected values back to today. In scenario analysis, a financial institution is inter- 
ested in exploring the full range of situations that might exist at a particular future time. 
Usually, it is the adverse outcomes that receive the most attention because risk managers 
working for the financial institution are interested in answering the question: “How bad 
can things get?” 

Suppose that a company sells one million one-year European call options on a stock. 
The stock price is $50 and the strike price is $55. The company might calculate the 
theoretical value of the options as +$4.5 million to the buyer and —$4.5 million to itself. 
If it sells the options for, say, $5 million, it can book $0.5 million of profit. But a scenario 
analysis might reveal that there is a 5% chance of the stock price rising to above $80 
in one year. This means that there is a 5% chance that the transaction will cost more 
than $20 million, after the initial amount received for the options has been taken into 
account. This example emphasizes the key difference between valuation and scenario 
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analysis. Valuation focuses on what will happen on average. (In our example, $4.5 million 
is the present value of the average payoff on the option.) Scenario analysis focuses on 
extreme outcomes. (In our example, $20 million is a possible net cost of the transaction 
to the company.) 

This chapter discusses the way valuation and scenario analysis should be carried out 
in practice. It distinguishes between real-world projections, which underlie scenario anal- 
ysis, and risk-neutral projections, which are used for valuation. It shows that risk-neutral 
valuation can be used for variables such as asset prices that evolve through time and to 
deal with situations where an outcome depends on whether a particular discrete event 
occurs (for example, a company defaulting). The chapter shows how Monte Carlo sim- 
ulations can be carried out. It explains the assumptions that are typically made for asset 
prices when they are projected. 


7.1 Volatility and Asset Prices 


As a preliminary to our discussion of valuation and scenario analysis, it is useful to pro- 
duce a few results concerned with the behavior of asset prices. Suppose an asset price 
is currently Sọ. A common assumption is that it has a constant expected growth rate of 
H per year (expressed with continuous compounding), and a constant volatility of © per 
year.! It can be shown that the probability density of the asset price, Sy, at time T years 
is then given by? 


In Sy ~ oin S, + (u — 6°/2)T, o° T] (7.1) 


where þ(m, v) denotes a normal distribution with mean m and variance v and In is the 
natural logarithm function. The variable Sy has what is termed a lognormal distribution 
because its natural logarithm is normally distributed. The mean of In Sy is In Sọ + (p — 
o°/2)T and the standard deviation of In S+ is oVT : 

The probability of S being less than some value V is the same as the probability of 
In Sy being less than In V. From the properties of the normal distribution this is 


z shira 
Prob(S; < V) = N In Sy ei) = N(-d,) (7.2) 


oVT 


where 
_ In(S,/V) + 4- 02/2)T 


d; = 
oVT 


1 Continuous compounding is explained in Appendix A. Volatility, as its name implies, is a measure 
of the uncertainty associated with the stock price movements. It will be defined more precisely in 
Chapter 10. 

?See, for example, J. Hull, Options, Futures, and Other Derivatives, 10th ed. (Upper Saddle River, NJ: 
Pearson, 2018). 
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and N is the cumulative normal distribution function (given by NORMSDIST in Excel). 
The probability that Sy is greater than V at time T is 


Prob(S. > V) = 1 — N(—d) = N(d») (7.3) 


Finally, suppose that we want to find the value, V, of Sy that has a probability q of being 
exceeded. This means that Prob(S; > V) = q. From equation (7.3), we require N(d,) = q 
so that 


_ (S/V) + (H 07 /2)T 


dg = 
oT 


= N7'(q) 


or 
V = S,exp ( — =) T- N“@ov7| (7.4) 


where N~! is the inverse of the cumulative normal distribution function (given by 
NORMSINV in Excel). Similarly, the value, V, of Sy such that Prob(S; < V) = q is 
given by setting N(—d,) = q or d, = —N7!(q). This leads to 


V = S,exp ( = =) ae N'oo VT] (7.5) 


7.2 Risk-Neutral Valuation 


The single most important result in the valuation of derivatives is risk-neutral valuation. 
A risk-neutral world can be defined as an imaginary world where investors require no 
compensation for bearing risks. In this world, the required expected return from a risky 
investment is the same as that for a safe investment. In both cases, the expected return 
is the risk-free rate. The world we live in is clearly not a risk-neutral world. Investors 
do require compensation for bearing risks. (A framework for understanding risk-return 
trade-offs is presented in Chapter 1.) The risk-neutral valuation result, however, states 
that we can value any derivative by assuming that the world is risk-neutral. We get the 
right answer not just for the risk-neutral world but for all other worlds as well. 

At first blush, the risk-neutral valuation result seems to make no sense. Investors 
do not live in a risk-neutral world. They do require higher expected returns when the 
risks they are bearing increase, and this applies to derivatives as well as to other invest- 
ments? But the key point to note when trying to understand risk-neutral valuation is 
that when we are valuing a derivative we are calculating its value in terms of the price of 


3 As explained in Chapter 1, it is systematic (i.e., non-diversifiable) risk that matters to investors. 
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an underlying asset. (The value of a stock option, for example, is calculated in terms of the 
price of the underlying stock.) The price of the underlying asset reflects the risk-return 
trade-offs of market participants. If market participants decide that they require a higher 
(or lower) return from an asset because of its risk, the price of the asset goes down (or 
up). What risk-neutral valuation says is that the formula for translating the price of the 
underlying asset into the value of the derivative is independent of the risk preferences of 
investors. 

In a risk-neutral world, all future expected cash flows are discounted at the risk- 
free interest rate. This simplifies valuation considerably. Suppose that we are valuing a 
call option on a stock and the risk-free interest rate is 3%. The steps in implementing 
risk-neutral valuation are: 


1. Assume that the expected (average) future return on the stock is 3%. 
2. Calculate the expected payoff from the call option. 
3. Discount the expected payoff at 3% to obtain the option’s value. 


A natural question is: “Why work in the risk-neutral world when it is more natural 
to work in the real world?” The answer is that we could in theory value the call option 
in the real world, but it would be much more difficult. The steps would be 


la. Estimate the expected (average) future return on the stock in the real world. 

2a. Calculate the expected payoff from the call option in the real world. 

3a. Discount the expected payoff at an appropriate discount rate to obtain the option’s 
value. 


For step 1a, we might be able to come up with a reasonable estimate of the 
(real-world) expected future return on the stock by estimating its beta and using the 
capital asset pricing model results in Chapter 1. Step 3a, however, would be really 
difficult. The correct discount rate to use for the expected payoff in the real world 
depends on the option’s beta (not the stock’s beta). It is likely to vary during the option’s 
life. As the stock price changes, the leverage implicit in the option changes and so the 
discount rate changes. If a single discount rate is used for the whole of the life of a call 
option, it is surprisingly high. Similarly, if a single discount rate is used for the whole of 
the life of a put option, it is surprisingly low, usually negative. How do we know this? 
We can use risk-neutral valuation to value the options and then work back from the 
answer to see what the correct discount rate would have to be for real-world payoffs to 
give that answer. We will illustrate this for the case of a binary option shortly. 

Risk-neutral valuation is an almost miraculous result. It means that we do not need 
to concern ourselves with issues such as the riskiness of a derivative and the return 
required by the market on either the underlying asset or the derivative. All we need to 
ask ourselves is: If we lived in a world where investors required an expected return equal 
to the risk-free rate on all investments, how would we value the derivative? Without 
risk-neutral valuation, the valuation of derivatives would be far more difficult (and far 
less precise) than it is. 
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It should be emphasized that risk-neutral valuation (or the assumption that investors 
do not care about risks when determining expected returns) is nothing more than an 
artificial device for valuing derivatives. The valuations that are obtained are correct in all 
worlds, not just the risk-neutral world. When we move from the risk-neutral world to 
the real world, two things happen. The expected payoffs from the derivative change, and 
the discount rate that must be used for the payoffs changes. It happens that these two 
changes always offset each other exactly. 


7.2.1 Application to Forward Contract 


As a simple example of risk-neutral valuation, consider the valuation of a long forward 
contract on a non-dividend-paying stock. Suppose that the delivery price is K and the 
time to maturity is T. The value of the contract at maturity is 


Sr = K 
where Sy is the stock price at time T. From the risk-neutral valuation argument, the 
value of the forward contract at time zero (today) is its expected value at time T in a 


risk-neutral world, discounted at the risk-free rate of interest. Denoting the value of the 
forward contract by f, this means that 


f =T Ê(Sr- K) 
where Ê denotes expected value in a risk-neutral world and r is the risk-free rate 


(expressed with continuous compounding and assumed constant). Because K is a con- 
stant, this becomes 


f = T £E(S,;) — Ke" (7.6) 
The expected growth rate of the stock price is r in a risk-neutral world. As a result, 
E(Sp) = Sge" 
where Sọ is the stock price today. Substituting this into equation (7.6) gives 
f = So — Ke7 (7.7) 
Similarly, the value of a short forward contract is 
Ke? — Sy (7.8) 


These results are consistent with Appendix C. 
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7.2.2 Application to Binary Options 


As a further example of the application of risk-neutral valuation, suppose that the price 
of a non-dividend-paying stock is $30 and there is a derivative that pays off $100 in one 
year if the stock price is greater than $40 at that time. (This is known as a binary or digital 
cash-or-nothing call option.) Suppose that the risk-free rate (continuously compounded) 
is 3% per annum, the expected growth rate of the stock price in the real world is 10% per 
annum (also continuously compounded), and the stock price volatility is 30% per annum. 

In a risk-neutral world, the expected growth of the stock price is 3% per annum. The 
risk-neutral probability of the stock price being greater than $40 in one year is obtained 
by setting p = 0.03, T = 1,6 = 0.3, Sọ = 30, and V = 40 in equation (7.3). It is 


In(30/40) + (0.03 — 0.37/2) x 1 


0.3x y1 


The expected payoff from the derivative in a risk-neutral world is therefore 100 x 
0.1565 = $15.65. The value of the derivative is calculated by discounting this for one 
year at the risk-free rate of 3%. It is 15.65e~"°9*! or $15.19. 

The real-world probability of the stock price being greater than $40 in one year 


= N(—1.0089) = 0.1565 


is calculated by setting u equal to the assumed real-world return on the stock, 10%, in 
equation (7.3). It is 0.2190. (As will be explained later in this chapter, we do not have 
to change the volatility when moving from the risk-neutral world to the real world or 
vice versa.) The expected payoff in the real world is therefore $21.90. As mentioned 
earlier, the problem with using the real-world expected payoff for valuation is that we 
do not know what discount rate to use. The stock price has risk associated with it that 
is priced by the market (otherwise the expected return on the stock would not be 7% 
more than the risk-free rate). The derivative has the effect of “leveraging this risk” so 
that a relatively high discount rate is required for its expected payoff. Because we know 
the correct value of the derivative is $15.19, we can deduce that the correct discount 
rate to apply to the $21.90 real-world expected payoff must be 36.6%. (This is because 
21.90e70366x1 = 15.19.) 


7.2.3 The Black-Scholes-Merton Application 


Consider next a European call option on a non-dividend-paying stock with strike price 
K and maturity T. Suppose that the risk-free rate is r. The payoff at time T is 


max(Sr — K, 0) 


where Sy is the stock price at time T. The expected payoff in a risk-neutral world is 
therefore 


A 


E[max(S, — K, 0)] 
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where as before E denotes expected value in a risk-neutral world. Using risk-neutral 
valuation, the value of the option is 


eT E[max(S; — K, 0)] (7.9) 
Similarly, the value of a put option is 
eT E[max(K — S+, 0)] 


After some algebraic manipulations, it can be shown that these equations lead to the 
Black—Scholes—Merton formulas for European stock options given in Appendix E at the 
end of the book f 


7.2.4 Discrete Outcomes 


Risk-neutral valuation can be used when outcomes are discrete. Suppose that one of two 
mutually exclusive outcomes will occur at time T. Define 7, as the value of a derivative 
that pays off $1 at time T if the first outcome occurs and nothing otherwise. Similarly, 
define mT, as the value of a derivative that pays off $1 at time T if the second outcome 
occurs and nothing otherwise. By buying both derivatives at a cost of 1, +25, we can be 
certain to receive $1 at time T. The value of $1 received with complete certainty at time 
T is PT where R is the (continuously compounded) risk-free interest rate for maturity 
T. (We do not have to assume constant interest rates.) It follows that 


nm, +m =e (7.10) 


Now consider a derivative that at time T provides a payoff of V; if the first outcome 
is realized and V, if the second outcome is realized. The value of the derivative is 


TV, +t N 


This is 
T Ty 
(1, + T2) —— V + — V, 


* As shown in J. C. Hull, Options, Futures, and Other Derivatives, 10th ed. (Upper Saddle River, NJ: Pear- 
son, 2018), there are three ways of obtaining the Black—Scholes—Merton formula. One is by deriving 
the differential equation that must be satisfied by all derivatives and solving it subject to appropriate 
boundary conditions. Another is to construct a binomial tree for the behavior of the stock price and 
take the limit as the length of the time step tends to zero. The third approach is to work from equation 
(7.9). The math for this last approach is in the appendix to Chapter 15 of Options, Futures, and Other 
Derivatives, 10th ed. 
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Substituting from equation (7.10), the value of the derivative is 
eRT (p V + pV) 


where 


Ti T2 
Pı = P2 = 


From this result it is natural to think of p, and p, as the risk-neutral probabilities 
of the two outcomes occurring. The value of a derivative is then the expected payoff 
in a risk-neutral world discounted at the risk-free rate. This illustrates that risk-neutral 
valuation applies to discrete outcomes. 

The result can be extended to the situation where there are many outcomes. Suppose 
that one of n mutually exclusive outcomes will occur at time T. Define 7; as the value of 
a derivative that pays off $1 if the ith outcome occurs and nothing otherwise (1 <i < n). 
The value of a derivative that pays V; for outcome i(1 < i < n) is 


where p,, the risk-neutral probability of the ith outcome, is given by 


T: 


1 


LTT 
: via 


7.2.5 Application to Default Probabilities 


Consider an instrument whose payoff depends on whether a particular company has 
defaulted. (This type of instrument is known as a credit derivative.) The type of analysis 
we have just presented can be used to show that the derivative should be valued by 


1. Estimating risk-neutral default probabilities 
2. Calculating the expected payoff from the instrument 
3. Discounting the expected payoff at the risk-free rate 


As will be explained in Chapter 19, the risk-neutral default probabilities can be 
implied from the yields on bonds issued by the company or credit default swap spreads. 
They are in general higher than real-world default probabilities. 


7.3 Scenario Analysis 


We now move on to consider scenario analysis. Here we are interested in examining 
what might happen in the future. The objective is not valuation, and future cash flows 
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are not discounted back to today. The world we consider when carrying out a scenario 
analysis should be the real world, not the risk-neutral world. The risk-neutral world, it 
should be remembered, is nothing more than an artificial device for valuing derivatives. 
Risk managers are not normally interested in future outcomes in a hypothetical world 
where everyone is risk neutral. 

Moving between the real world and the risk-neutral world is simplified by a result 
known as Girsanov’s theorem. This states that when we move from a world with one set of 
risk preferences to a world with another set of risk preferences, the expected growth rates 
of market variables such as stock prices, commodity prices, exchange rates, and interest 
rates change but their volatilities remain the same. 

To illustrate how scenario analysis is carried out, suppose that the expected return 
on a stock in the real world is 8%. The stock price is currently $30 per share and its 
volatility is 25%. Assume you own 10,000 shares. How much could you lose during the 
next year? 

From equation (7.5), the five percentile point of the stock price distribution in one 
year in the real world is: 


30 x exp | (0.08 — 0.257/2) x 1 + N7'(0.05) x 0.25 x vi = 20.88 


Similarly, the one percentile point of the stock price distribution is $17.61. 

These results show that you can be 95% certain that you will not lose more than 
$10,000 x ($30 —$20.88) or $91,200 during the next year. Similarly, you can be 99% 
certain you will not lose more than $123,900. As we will see in later chapters, these are 
what are known as value at risk estimates. 

The key point here is that results are based on the real-world return, not the riskless 
return. This is because we are conducting a scenario analysis, not a valuation. 


7.4 When Both Worlds Have to Be Used 


Sometimes a scenario analysis requires us to use both the real world and the risk-neutral 
world. The real world is used to generate scenarios out to the time horizon being consid- 
ered. The risk-neutral world is then used to value all outstanding derivative transactions 
at that time. To take a simple example, suppose a portfolio consists of a single two-year 
forward contract to sell one million shares of a stock. The stock price is $50 per share, the 
delivery price in the forward contract is $55, the risk-free rate is 3%, the expected return 
on the stock is 10%, and its volatility is 30%. We wish to carry out a scenario analysis to 
investigate what the value of the portfolio might be after six months. In order to do this, 
we must follow two steps: 


1. Calculate a probability distribution for the stock price at the end of six months in 
the real world. 
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2. Value the forward contract at the end of six months for the different stock prices that 
might arise to determine a probability distribution for the contract value at the end 
of six months. This involves a risk-neutral valuation calculation. (At the end of six 
months, the contract has 1.5 years remaining.) 


Suppose we are interested in a “worst case” outcome in six months where the loss 
has a probability of only 1% of being exceeded. In this case, because the portfolio is so 
simple, the worst outcome corresponds to the stock price that has a probability of only 
1% of being exceeded in the real world in six months. From equation (7.4) this stock 
price is 


50 exp [(0.1 — 0.37/2) x 0.5 — N7'(0.01) x 0.3 x V0.5] = 84.18 
For this stock price, equation (7.8) gives the value of the forward contract as 
55¢ 1X0 _ 84.18 = —31.61 


There is therefore a 1% chance that the portfolio will be worth less than —$31.61 million 
in six months. 

Note that the worst-case stock price was calculated in the real world. The forward 
contract was then valued for this worst-case outcome using risk-neutral valuation. 


7.5 The Calculations in Practice 


The example we have just considered is very simple because the portfolio consisted of 
a single instrument—a two-year short forward contract. We know that the value of this 
forward contract decreases as the price of the underlying stock increases. When the stock 
price has only a 1% chance of being exceeded, we know that the forward contract has a 
value that has only a 1% chance of being worsened. 

In practice, a financial institution usually has many instruments in its portfolio and 
the calculations necessary for a scenario analysis can be quite complicated. It is necessary 
to generate many scenarios for what might happen in the real world between today and 
the horizon date and then value the portfolio for each of these scenarios. The loss that 
has, say, a 1% probability of being exceeded can then be calculated. For example, if 1,000 
scenarios are considered, this loss is the 10th worst one. 

For stock prices, stock indices, and exchange rates, the most common model is one 
where the expected growth rate in the market variable, p, and its volatility, ©, are assumed 
to be constant, or perhaps functions of time. From equation (7.1), if S, is the value of the 
market variable at time f, 


In(S,,.a,) = In(S,) + (u—0?/2)At+ eo VAt 


Valuation and Scenario Analysis 155 


where € is a random sample from a normal distribution with mean zero and standard 
deviation one. This means that 


Sica; = S expia — 0° /2)At + eo At 


This equation allows the market variable to be simulated in steps of At by sampling from 
a standard normal distribution. 

In the case of short-term interest rates, volatilities, and commodity prices, a more 
complicated model where the variable exhibits volatility, but is pulled toward a long-run 
average level is usually assumed. This phenomenon is known as mean reversion. 

Different market variables are not usually assumed to move independently of each 
other. Correlations between market variables are usually estimated from historical data. 
These correlations are then reflected in the correlations between the € samples from 
standard normal distributions. (The way in which samples from a multivariate normal 
distribution are generated with particular correlations is described in Chapter 11.) 

As will be evident from this short description, scenario analyses can be very time 
consuming. In addition to sampling to determine the value of market variables on each 
trial, it is necessary to value the portfolio at the horizon date on each trial. Often, grid 
computing, where many computers are involved in completing a single activity, is used. 
Sometimes the number of Monte Carlo trials used must be restricted in order to produce 
results in a reasonable time. 


7.6 Estimating Real-World Processes 


The main problem in scenario analysis is that we usually have much more information 
about the behavior of market variables in the risk-neutral world than in the real world. We 
know that an investment asset such as a stock price must provide a return equal to the risk- 
free rate in the risk-neutral world, and we can estimate its volatility from either historical 
data or option prices. For a consumption asset (i.e., an asset that is not held purely for 
investment), futures prices can be used to provide information about how its price is 
expected to behave in a risk-neutral world. Similarly, the term structure of interest rates 
provides information about the process followed by interest rates in a risk-neutral world. 

Unfortunately, there is no simple way of determining the behavior of these variables 
in the real world. In theory, expected returns can be calculated from historical data. In 
practice, the amount of historical data required to get a reasonably accurate estimate is 
huge (much greater than that required to get a reasonable estimate of volatility). 

One approach to determining the real-world expected return on a stock is to use 
the capital asset pricing model (see Chapter 1). We first estimate p, the correlation of 
the return on the stock with the return on an index that is representative of the whole 
market, such as the S&P 500. As explained in Section 1.3, the stock’s beta, B, can be 
estimated as 
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where o is the volatility of the stock’s return and © y is the volatility of the S&P 500. The 
capital asset pricing model can be used to get the return in the real world as 


Rp + BE 


where Rp is the risk-free rate and E is the expected excess return of the market over the 
risk-free rate (often assumed to be 5% or 6%). 

A similar idea can be used for other variables. Suppose that the volatility of a market 
variable is © (the same in both the real world and the risk-neutral world). The excess of 
percentage changes in the variable in the real world over those in the risk-neutral world 
is Ao, where A is a parameter known as the variable’s market price of risk. In general, 


= 


Om 
where p is the correlation between percentage changes in the value of the variable and 
returns on the S&P 500. 

Consider a commodity price. If its return is uncorrelated with the return on the 
S&P 500, its expected return can be assumed to be the same in the real world and in the 
risk-neutral world. Alternatively, if p = 0.3,6,, = 0.2,and E = 0.06, we can deduce that 
à = 0.09. If the commodity’s price volatility is 40%, its return should be 0.09 x 0.40 or 
3.6% higher in the real world than in the risk-neutral world. 

In the case of interest rates, the market price of risk is negative (typically, in the —0.1 
to —0.2 range)’ This means that in the risk-neutral world interest rates grow faster than 
they do in the real world. (This makes interest rates different from stock prices, where 
the reverse is true.) 


Summary 


A confusing aspect of risk management is that valuation and scenario analysis are (or 
should be) based on different assumptions about how market variables such as stock 
prices, commodity prices, and exchange rates behave. To value a derivative in terms of 
the price of the underlying asset, the somewhat artificial assumption that the world is 
risk neutral is made. This means that the expected return from all assets that are held 
for investment purposes is assumed to be the risk-free rate and that expected payoffs are 
discounted at the risk-free rate. The ubiquitous risk-neutral valuation result states that 
the valuation we obtain when we do this is correct in the real world as well as in the 
risk-neutral world. 


>See J. Hull, A. Sokol, and A. White, “Short-Rate Joint-Measure Models,” Risk (October 2014): 59- 
63, for an approach to estimating the market price of interest rate risk. The Federal Reserve Board in 
the United States sometimes uses a rule of thumb that the drift of the real rate is one basis point per 
month below the real rate. 
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In scenario analysis, we are interested in how market variables behave in the real world 
(i.e., the world we actually live in). Fortunately, there is a result, Girsanov’s theorem, that 
tells us that the volatility of a variable is the same in the real and risk-neutral worlds. The 
expected return, however, is liable to be quite different in the two worlds. For example, 
the expected return from a stock or stock index is quite a bit higher in the real world 
than in the risk-neutral world. This is because, as discussed in Chapter 1, investors require 
compensation for bearing risks. 

A further confusing point is that sometimes it is necessary to consider both the real 
world and the risk-neutral world. Consider a financial institution that has a portfolio of 
derivatives and is interested in how much it could lose over the next year. The financial 
institution should consider how the relevant market variables will behave in the real 
world to generate many alternative scenarios for their values in one year. It should then 
use risk-neutral valuation to determine the value of the portfolio at the one-year point 
for each of the alternative scenarios. 
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Practice Questions and Problems 
(Answers at End of Book) 


7.1 A stock price has an expected return of 12% and a volatility of 20%. It is currently 
$50. What is the probability that it will be greater than $70 in two years? 

7.2 In Problem 7.1, what is the stock price that has a 5% probability of being exceeded 
in two years? 

7.3 Explain the principle of risk-neutral valuation. 

7.4 An analyst calculates the expected future value of a stock index in (a) the real world 
and (b) the risk-neutral world. Which would you expect to be higher? Why? 

7.5 The value of a derivative that pays off $100 after one year if a company has defaulted 
during the year is $3. The value of a derivative that pays off $100 after one year if a 
company has not defaulted is $95. What is the risk-free rate? What is the risk neutral 
probability of default? 

7.6 A binary option pays off $100 if a stock price is greater than $30 in three months. 
The current stock price is $25 and its volatility is 30%. The risk-free rate is 3% and 
the expected return on the stock is 10%. What is the value of the option? What is 
the real-world probability that the payoff will be received? 
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7.7 Explain why it is sometimes necessary to work in both the real world and the risk- 
neutral world when carrying out a scenario analysis to determine a confidence inter- 
val for the value of a portfolio in one year. 

7.8 Explain the meaning of mean reversion. 

7.9 Explain Girsanov’s theorem. 


Further Questions 


7.10 A stock price has an expected return of 9% and a volatility of 25%. It is currently 
$40. What is the probability that it will be less than $30 in 18 months? 

7.11 An investor owns 10,000 shares of a particular stock. The current market price is 
$80. What is the worst-case value of the portfolio in six months? For the purposes 
of this question, define the worst-case value of the portfolio as the value that is 
such that there is only a 1% chance of the actual value being lower. Assume that 
the expected return on the stock is 8% and its volatility is 20%. 

7.12 A binary option pays off $500 if a stock price is greater than $60 in three months. 
The current stock price is $61 and its volatility is 20%. The risk-free rate is 2% and 
the expected return on the stock is 8%. What is the value of the option? What is 
the real-world expected payoff? 


Part Two 


MARKET RISK 


Chapter 8 


How Traders Manage 
Their Risks 


he trading function within a financial institution is referred to as the front office. 
T The part of the financial institution that is concerned with the overall level of 

the risks being taken, capital adequacy, and regulatory compliance is referred to 
as the middle office. The record keeping function is referred to as the back office. As explained 
in Section 1.6, there are two levels within a financial institution at which trading risks 
are managed. First, the front office hedges risks by ensuring that exposures to individual 
market variables are not too great. Second, the middle office aggregates the exposures of 
all traders to determine whether the total risk is acceptable. In this chapter we focus on 
the hedging activities of the front office. In later chapters we will consider how risks are 
aggregated in the middle office. 

This chapter explains what are termed the “Greek letters” or simply the “Greeks.” 
Each of the Greeks measures a different aspect of the risk in a trading position. Traders 
calculate their Greeks at the end of each day and are required to take action if the internal 
risk limits of the financial institution they work for are exceeded. Failure to take this 
action is liable to lead to immediate dismissal. 


8.1 Delta 


Imagine that you are a trader working for a U.S. bank and are responsible for all trades 
involving gold. The current price of gold is $1,300 per ounce. Table 8.1 shows a summary 
of your portfolio (known as your “book”). How can you manage your risks? 
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Table 8.1 Summary of Gold Portfolio 


Position Value ($) 
Spot gold 3,180,000 
Forward contracts —3,060,000 
Futures contracts 2,000 
Swaps 180,000 
Options —6,110,000 
Exotics 125,000 
Total —5,683,000 


The value of your portfolio is currently —$5,683,000. (This could be partly because 
you have been a net seller of options and partly because the market has moved against 
you.) One way of investigating the risks you face is to revalue the portfolio on the 
assumption that there is a small increase in the price of gold from $1,300 per ounce to 
$1,300.10 per ounce. Suppose that this $0.10 increase in the price of gold decreases the 
value of your portfolio by $100 from —$5,683,000 to —$5,683,100. This means that the 
sensitivity of the portfolio to the price of gold is 


z100 = —1,000 
0.1 


This is referred to as the delta of the portfolio. The portfolio loses value at a rate of about 
$1,000 per $1 increase in the price of gold. Similarly, it gains value at a rate of about 
$1,000 per $1 decrease in the price of gold. 

In general, the delta of a portfolio with respect to a market variable is 


AP 
AS 
where AS is a small increase in the value of the variable and AP is the resulting change 


in the value of the portfolio. Using calculus terminology, delta is the partial derivative of 
the portfolio value with respect to the value of the variable: 


oP 
Delta = — 
elta JS 


In our example, the trader can eliminate the delta exposure by buying 1,000 ounces 
of gold. This is because the delta of a long position in 1,000 ounces of gold is 1,000. (The 
position gains value at the rate of $1,000 per $1 increase in the price of gold.) This is 
known as delta hedging. When the hedging trade is combined with the existing portfolio 
the resultant portfolio has a delta of zero. Such a portfolio is referred to as delta neutral. 


8.1.1 Linear Products 


A linear product is one whose value at any given time is linearly dependent on the value 
of an underlying market variable (see Figure 8.1). If the underlying variable is the price 
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Value of 
product 


Value of market variable 


Figure 8.1 A Linear Product 


of an asset such as gold, a spot position in the asset is clearly a linear product. The value of 
the position varies linearly with the price of the asset. A forward contract is also a linear 
product. However, as we shall see, an option is not. 

A linear product can be hedged relatively easily. As a simple example, consider a U.S. 
bank that has entered into a forward contract with a corporate client where it agreed to 
sell the client 1 million euros for $1.3 million in one year. Assume that the euro and dol- 
lar interest rates are 4% and 3% with annual compounding. This means that the present 
value of a 1 million euro cash flow in one year is 1,000,000/1.04 = 961,538 euros. The 
present value of 1.3 million dollars in one year is 1,300,000/1.03 = 1,262,136 dollars. 
Suppose that S is the value of one euro in dollars today. The value of the contract today in 


dollars is! 


1,262,136 — 961,538S 


This shows that the value of the contract is linearly related to the exchange rate, S. 
The delta of the contract is —961,538. It can be hedged by buying 961,538 euros. Because 
of the linearity, the hedge provides protection against both small and large movements 
in S. 

When the bank enters into the opposite transaction and agrees to buy one million 
euros in one year, the value of the contract is also linear in S 


961,538S — 1,262,136 


The bank has a delta of +961,538. It must hedge by shorting 961,538 euros. It does this 
by borrowing the euros today at 4% and immediately converting them to U.S. dollars. 
The one million euros received in one year are used to repay the loan. 


See Appendix C for more information on the valuation of forward contracts. 
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BUSINESS SNAPSHOT 8.1 
Hedging by Gold Mining Companies 


It is natural for a gold mining company to consider hedging against changes in 
the price of gold. Typically it takes several years to extract all the gold from a mine. 
Once a gold mining company decides to go ahead with production at a partic- 
ular mine, it has a big exposure to the price of gold. Indeed a mine that looks 
profitable at the outset could become unprofitable if the price of gold plunges. 
Gold mining companies are careful to explain their hedging strategies to 
potential shareholders. Some gold mining companies do not hedge. They tend 
to attract shareholders who buy gold stocks because they want to benefit when 
the price of gold increases and are prepared to accept the risk of a loss from a 
decrease in the price of gold. Other companies choose to hedge. They estimate 
the number of ounces they will produce each month for the next few years and 
enter into futures or forward contracts to lock in the price that will be received. 
Suppose you are Goldman Sachs and have just entered into a forward con- 
tract with a gold mining company where you agree to buy at a future time a 
large amount of gold at a fixed price. How do you hedge your risk? The answer 
is that you borrow gold from a central bank and sell it at the current market 
price. (The central banks of some of the countries that hold large amounts of 
gold are prepared to lend gold for a fee known as the gold lease rate.) At the end 
of the life of the forward contract, you buy gold from the gold mining company 
under the terms of the forward contract and use it to repay the central bank. 


Shorting assets to hedge forward contracts is sometimes tricky. Gold is an interesting 
case in point. Financial institutions often find that they enter into very large forward 
contracts to buy gold from gold producers. This means that they need to borrow large 
quantities of gold to create a short position for hedging. As outlined in Business Snap- 
shot 8.1, central banks are the source of the borrowed gold. A fee known as the gold 
lease rate is charged by central banks for lending the gold. 

Linear products have the attractive property that hedges protect against large changes 
as well as small ones in the value of the underlying asset. They also have another related 
attractive property: the hedge, once it has been set up, never needs to be changed. (This 
is sometimes referred to as the “hedge and forget” property.) For an illustration of this, 
consider again the first forward contract we considered where a bank agrees to sell a 
client 1 million euros for 1.3 million dollars. A total of 961,538 euros are purchased 
to hedge the position. These can be invested at 4% for one year so that they grow to 
exactly 1 million euros in one year. This is exactly what the bank needs to complete 
the forward transaction in one year so that there is no need to adjust the hedge during 
the year. 
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8.1.2 Nonlinear Products 


Options and other more complex derivatives dependent on the price of an underlying 
asset are nonlinear products. The relationship between the value of the product and the 
underlying asset price at any given time is nonlinear. This nonlinearity makes them more 
difficult to hedge for two reasons. First, making a nonlinear portfolio delta neutral only 
protects against small movements in the price of the underlying asset. Second, we are 
not in a hedge-and-forget situation. The hedge needs to be changed frequently. This is 
known as dynamic hedging. 

Consider as an example a trader who sells 100,000 European call options on a non- 
dividend-paying stock when 


1. Stock price is $49 

2. Strike price is $50 

3. Risk-free interest rate is 5% 

4. Stock price volatility is 20% per annum 
5. Time to option maturity is 20 weeks 


We suppose that the amount received for the options is $300,000 and that the trader has 
no other positions dependent on the stock. 

The value of one option as a function of the underlying stock price is shown in 
Figure 8.2. Because the function is nonlinear (i.e., curved), the delta of a position in one 
option depends on the stock price. When the stock price is low, the value of the option 
is almost zero and the dollar increase in the value of the option as a result of a small 
change in the stock price is also close to zero. This means that the delta of the option 
is close to zero. When the stock price is high, say $70, the option is almost certain to 
be exercised and the delta of a position in one option is close to 1.0. This is because an 
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Figure 8.2 Value of Call Option as a Function of Stock Price (strike price = $50, risk-free rate = 
5%, volatility = 20%, time to maturity = 20 weeks) 
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Figure 8.3 Delta of Call Option as a Function of Stock Price (strike price = $50, risk-free rate = 
5%, volatility = 20%, time to maturity = 20 weeks) 


increase (decrease) of AS in the stock price increases (decreases) the value of the option 
by almost AS. 

In our example, the stock price is $49. The delta of the option is the gradient of the 
curve at this point. This is 0.522, as indicated in Figure 8.2. (See Appendix E for the 
calculation of the Greeks for European options.) The delta of one option changes with 
the stock price in the way shown in Figure 8.3 At the time of the trade, the value of an 
option to buy one share of the stock is $2.40 and the delta of the option is 0.522. Because 
the trader is short 100,000 options, the value of the trader’s portfolio is —$240,000 and 
the delta of the portfolio is —52,200. The trader can feel pleased that the options have 
been sold for $60,000 more than their theoretical value, but is faced with the problem 
of hedging the risk in the portfolio. 

Immediately after the trade, the trader’s portfolio can be made delta neutral by buying 
52,200 shares of the underlying stock. If there is a small decrease (increase) in the stock 
price, the gain (loss) to the trader of the short option position should be offset by the 
loss (gain) on the shares. For example, if the stock price increases from $49 to $49.10, 
the value of the option position will decrease by about 52,200 X 0.10 = $5,220, while 
the value of the shares will increase by this amount. 

As mentioned, in the case of linear products, once the hedge has been set up it does 
not need to be changed. This is not the case for nonlinear products. To preserve delta 
neutrality, the hedge has to be adjusted periodically. This is known as rebalancing. 

Tables 8.2 and 8.3 provide two examples of how rebalancing might work in our 
example. Rebalancing is assumed to be done weekly. As mentioned, the initial value of 


? Figures 8.2 and 8.3 were produced with the RMFI software that can be downloaded from the author's 
website. The Black—Scholes—Merton model is selected by choosing “Black—Scholes-European” as the 
option type. 
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Table 8.2 Simulation of Delta Hedging (option closes in-the-money and cost of hedging is 
$263,300) 


Cost of 
Shares Cumulative 
Stock Shares Purchased Cash Outflow Interest Cost 
Week Price Delta Purchased ($000) ($000) ($000) 
0 49.00 0.522 52,200 2,557.8 2,557.8 2.5 
1 48.12 0.458 (6,400) (308.0) 2,252.3 2,2 
2 47.37 0.400 (5,800) (274.7) 1,979.8 1.9 
3 50.25 0.596 19,600 984.9 2,966.6 2,9 
4 51.75 0.693 9,700 502.0 3,471.5 3.3 
5 53.12 0.774 8,100 430.3 3,905.1 3.8 
6 53.00 0.771 (300) (15.9) 3,893.0 3.7 
7 51.87 0.706 (6,500) (337.2) 3,559.5 3.4 
8 51.38 0.674 (3,200) (164.4) 3,398.5 3.3 
9 53.00 0.787 11,300 598.9 4,000.7 3.8 
10 49.88 0.550 (23,700) (1,182.2) 2,822.3 2.7 
11 48.50 0.413 (13,700) (664.4) 2,160.6 2.1 
12 49.88 0.542 12,900 643.5 2,806.2 2.7 
13 50.37 0.591 4,900 246.8 3,055.7 2.9 
14 52.13 0.768 17,700 922.7 3,981.3 3.8 
15 51.88 0.759 (900) (46.7) 3,938.4 3.8 
16 52.87 0.865 10,600 560.4 4,502.6 4.3 
17 54.87 0.978 11,300 620.0 5,126.9 4.9 
18 54.62 0.990 1,200 65.5 5,197.3 5.0 
19 55.87 1.000 1,000 55.9 5,258.2 5.1 
20 57.25 1.000 0) 0.0 5,263.3 


delta for a single option is 0.522 and the delta of the portfolio is —52,200. This means 
that, as soon as the option is written, $2,557,800 must be borrowed to buy 52,200 shares 
at a price of $49. The rate of interest is 5%. An interest cost of approximately $2,500 is 
therefore incurred in the first week. 

In Table 8.2, the stock price falls by the end of the first week to $48.12. The delta 
declines to 0.458. A long position in 45,800 shares is now required to hedge the option 
position. A total of 6,400 (= 52,200 — 45,800) shares are therefore sold to maintain the 
delta neutrality of the hedge. The strategy realizes $308,000 in cash, and the cumulative 
borrowings at the end of week 1 are reduced to $2,252,300. During the second week the 
stock price reduces to $47.37 and delta declines again. This leads to 5,800 shares being 
sold at the end of the second week. During the third week, the stock price increases 
to over $50 and delta increases. This leads to 19,600 shares being purchased at the end 
of the third week. Toward the end of the life of the option, it becomes apparent that 
the option will be exercised and delta approaches 1.0. By week 20, therefore, the hedger 
owns 100,000 shares. The hedger receives $5 million (= 100,000 x $50) for these shares 
when the option is exercised so that the total cost of writing the option and hedging it 
is $263,300. 
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Table 8.3 Simulation of Delta Hedging (option closes out-of-the-money and cost of hedging = 
$256,600) 


Cost of 
Shares Cumulative 
Stock Shares Purchased Cash Outflow Interest Cost 
Week Price Delta Purchased ($000) ($000) ($000) 
0 49.00 0.522 52,200 2,557.8 2,557.8 2.5 
1 49.75 0.568 4,600 228.9 2,789.2 27 
2 52.00 0.705 13,700 712.4 3,504.3 3.4 
3 50.00 0.579 (12,600) (630.0) 2,877.7 2.8 
4 48.38 0.459 (12,000) (580.6) 2,299.9 2.2 
5 48.25 0.443 (1,600) (77.2) 2,224.9 2:1 
6 48.75 0.475 3,200 156.0 2,383.0 2.3 
7 49.63 0.540 6,500 322.6 2,707.9 2.6 
8 48.25 0.420 (12,000) (579.0) 2,131.5 2.1 
9 48.25 0.410 (1,000) (48.2) 2,085.4 2.0 
10 51.12 0.658 24,800 1,267.8 3,355.2 3.2 
11 51.50 0.692 3,400 175.1 3,533.5 3.4 
12 49.88 0.542 (15,000) 748.2) 2,788.7 2.7 
13 49.88 0.538 (400) (20.0) 2,771.4 2.7 
14 48.75 0.400 (13,800) 672.7) 2,101.4 2.0 
15 47.50 0.236 (16,400) 779.0) 1,324.4 1.3 
16 48.00 0.261 2,500 120.0 1,445.7 1.4 
17 46.25 0.062 (19,900) 920.4) 526.7 0.5 
18 48.13 0.183 12,100 582.4 1,109.6 1.1 
19 46.63 0.007 (17,600) 820.7) 290.0 0.3 
20 48.12 0.000 (700) (33.7) 256.6 


Table 8.3 illustrates an alternative sequence of events where the option closes out- 
of-the-money. As it becomes clear that the option will not be exercised, delta approaches 
zero. By week 20, the hedger therefore has no position in the underlying stock. The total 
costs incurred are $256,600. 

In Tables 8.2 and 8.3, the costs of hedging the option, when discounted to the 
beginning of the period, are close to, but not exactly, the same as the theoretical (Black— 
Scholes—Merton) price of $240,000. If the hedging scheme worked perfectly, the cost of 
hedging would, after discounting, be exactly equal to the Black—Scholes—Merton price 
for every simulated stock price path. The reason for the variation in the cost of delta 
hedging is that the hedge is rebalanced only once a week. As rebalancing takes place 
more frequently, the variation in the cost of hedging is reduced. Of course, the examples 
in Tables 8.2 and 8.3 are idealized in that they assume the model underlying the Black— 
Scholes—Merton formula is exactly correct and there are no transactions costs. 

Delta hedging aims to keep the value of the financial institution’s position as close to 
unchanged as possible. Initially, the value of the written option is $240,000. In the situa- 
tion depicted in Table 8.2, the value of the option can be calculated as $414,500 in week 9. 
Thus, the financial institution has lost 414,500 — 240,000 = $174,500 on its short option 
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position. Its cash position, as measured by the cumulative cost, is $1,442,900 worse in 
week 9 than in week 0. The value of the shares held has increased from $2,557,800 to 
$4,171,100 for a gain of $1,613,300. The net effect is that the value of the financial 
institution’s position has changed by only $4,100 during the nine-week period. 


8.1.3 Where the Cost Comes From 


The delta-hedging procedure in Tables 8.2 and 8.3 in effect creates a long position in 
the option synthetically to neutralize the trader’s short option position. As the tables 
illustrate, the procedure tends to involve selling stock just after the price has gone down 
and buying stock just after the price has gone up. It might be termed a buy-high, sell-low 
trading strategy! The cost of $240,000 comes from the average difference between the 
price paid for the stock and the price realized for it. 


8.1.4 Economies of Scale 


Maintaining a delta-neutral position in a single option and the underlying asset, in the 
way that has just been described, is liable to be prohibitively expensive because of the 
transactions costs incurred on trades. Maintaining delta neutrality is more feasible for 
a large portfolio of derivatives dependent on a single asset because only one trade in 
the underlying asset is necessary to zero out delta for the whole portfolio. The hedging 
transactions costs are absorbed by the profits on many different trades. This shows that 
there are economies of scale in trading derivatives. It is not surprising that the derivatives 
market is dominated by a small number of large dealers. 


8.2 Gamma 


As mentioned, for a nonlinear portfolio, delta neutrality only provides protection against 
small changes in the price of the underlying asset. 

The gamma, I’, of a portfolio measures the extent to which large changes cause prob- 
lems. Gamma is the rate of change of the portfolio’s delta with respect to the price of the 
underlying asset. It is the second partial derivative of the portfolio with respect to asset 
price: 


If gamma is small, delta changes slowly, and adjustments to keep a portfolio delta neutral 
need to be made only relatively infrequently. However, if gamma is large in absolute 
terms, delta is highly sensitive to the price of the underlying asset. It is then quite risky to 
leave a delta-neutral portfolio unchanged for any length of time. Figure 8.4 illustrates this 
point for an option on a stock. When the stock price moves from S to S’, delta hedging 
assumes that the option price moves from C to C’, when in fact it moves from C to 
C". The difference between C’ and C” leads to a hedging error. This error depends on 
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Figure 8.4 Hedging Error Introduced by Nonlinearity 
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Figure 8.5 Relationship between Gamma of an Option and Price of Underlying Stock Where K 
Is the Option’s Strike Price 


the curvature of the relationship between the option price and the stock price. Gamma 
measures this curvature? 

Gamma is positive for a long position in an option. The general way in which gamma 
varies with the price of the underlying stock is shown in Figure 8.5. Gamma is greatest 
for options where the stock price is close to the strike price K. 


8.2.1 Making a Portfolio Gamma Neutral 


A linear product has zero gamma and cannot be used to change the gamma ofa portfolio. 
What is required is a position in an instrument, such as an option, that is not linearly 
dependent on the underlying asset price. 

Suppose that a delta-neutral portfolio has a gamma equal to I’, and a traded option 
has a gamma equal to I. If the number of traded options added to the portfolio is wy, 
the gamma of the portfolio is 


3Indeed, the gamma of an option is sometimes referred to as its curvature. 
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Hence, the position in the traded option necessary to make the portfolio gamma neutral 
is wy = —I /T z. Including the traded option is likely to change the delta of the portfolio, 
so the position in the underlying asset then has to be changed to maintain delta neutrality. 
Note that the portfolio is gamma neutral only for a short period of time. As time passes, 
gamma neutrality can be maintained only if the position in the traded option is adjusted 
so that it is always equal to -I/I +. 

Making a delta-neutral portfolio gamma neutral can be regarded as a first correction 
for the fact that the position in the underlying asset cannot be changed continuously 
when delta hedging is used. Delta neutrality provides protection against relatively small 
asset price moves between rebalancing. Gamma neutrality provides protection against 
larger movements in the asset price between hedge rebalancing. Suppose that a portfolio 
is delta neutral and has a gamma of —3,000. The delta and gamma of a particular traded 
call option are 0.62 and 1.50, respectively. The portfolio can be made gamma neutral by 
including in the portfolio a long position of 


3,000 


= 2,000 
1.5 


in the call option. (The gamma of the portfolio is then —3,000 + 1.5 X 2,000 = 0.) How- 
ever, the delta of the portfolio will then change from zero to 2,000 x 0.62 = 1,240. A 
quantity, 1,240, of the underlying asset must therefore be sold to keep it delta neutral. 


8.3 Vega 


Another source of risk in derivatives trading is the possibility that volatility will change. 
The volatility of a market variable measures our uncertainty about the future value of 
the variable. (It will be discussed more fully in Chapter 10.) In option valuation models, 
volatilities are often assumed to be constant, but in practice they do change through time. 
Spot positions and forwards do not depend on the volatility of underlying asset prices, but 
options and more complicated derivatives do. Their values are liable to change because 
of movements in volatility as well as because of changes in the asset price and the passage 
of time. 

The vega of a portfolio, V, is the rate of change of the value of the portfolio with 


respect to the volatility, ©, of the underlying asset price.* 


_ oP 
do 
If vega is high in absolute terms, the portfolio’s value is very sensitive to small changes in 


volatility. If vega is low in absolute terms, volatility changes have relatively little impact 
on the value of the portfolio. 


“Vega is the name given to one of the “Greek letters” in option pricing, but it is not one of the letters 
in the Greek alphabet. 
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The vega of a portfolio can be changed by adding a position in a traded option. If V 
is the vega of the portfolio and Vy is the vega of a traded option, a position of — V / Vy 
in the traded option makes the portfolio instantaneously vega neutral. Unfortunately, a 
portfolio that is gamma neutral will not, in general, be vega neutral, and vice versa. If 
a hedger requires a portfolio to be both gamma and vega neutral, at least two traded 
derivatives dependent on the underlying asset must usually be used. 


Example 8.1 


Consider a portfolio dependent on the price of a single asset that is delta neutral, with a 
gamma of —5,000 and a vega of —8,000. The options shown in the table below can be 
traded. The portfolio could be made vega neutral by including a long position in 4,000 
of Option 1. This would increase delta to 2,400 and require that 2,400 units of the asset 
be sold to maintain delta neutrality. The gamma of the portfolio would change from 
—5,000 to —3,000. 


Delta Gamma Vega 
Portfolio 0 —5,000 —8,000 
Option 1 0.6 0.5 2.0 
Option 2 0.5 0.8 1.2 


To make the portfolio gamma and vega neutral, both Option 1 and Option 2 can 
be used. If w and w, are the quantities of Option 1 and Option 2 that are added to the 
portfolio, we require that 


—5,000 + 0.5; + 0.8w = 0 
—8,000 + 2.0; + 1.2w = 0 


The solution to these equations is w4 = 400, w, = 6,000. The portfolio can therefore be 
made gamma and vega neutral by including 400 of Option 1 and 6,000 of Option 2. 
The delta of the portfolio after the addition of the positions in the two traded options is 
400 x 0.6 + 6,000 x 0.5 = 3,240. Hence, 3,240 units of the underlying asset would have 
to be sold to maintain delta neutrality. 


The vega of a long position in an option is positive. The variation of vega with the 
price of the underlying asset is similar to that of gamma and is shown in Figure 8.6. 
Gamma neutrality protects against large changes in the price of the underlying asset 
between hedge rebalancing. Vega neutrality protects against variations in volatility. 

The volatilities of short-dated options tend to be more variable than the volatilities 
of long-dated options. The vega of a portfolio is therefore often calculated by changing 
the volatilities of short-dated options by more than that of long-dated options. One way 
of doing this is discussed in Section 10.10. 
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Figure 8.6 Variation of Vega of an Option with Price of Underlying Asset Where K Is Option’s 
Strike Price 
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Figure 8.7 Variation of Theta of a European Call Option with Stock Price Where K Is Option’s 
Strike Price 


8.4 Theta 


The theta of a portfolio, O, is the rate of change of the value of the portfolio with respect 
to the passage of time, with all else remaining the same. Theta is sometimes referred to 
as the time decay of the portfolio. 

Theta is usually negative for an option” This is because as the time to maturity 
decreases with all else remaining the same, the option tends to become less valuable. The 
general way in which © varies with stock price for a call option on a stock is shown in 
Figure 8.7. When the stock price is very low, theta is close to zero. For an at-the-money 
call option, theta is large and negative. Figure 8.8 shows typical patterns for the variation 
of © with the time to maturity for in-the-money, at-the-money, and out-of-the-money 
call options. 

Theta is not the same type of Greek letter as delta. There is uncertainty about the 
future price of the underlying asset, but there is no uncertainty about the passage of time. 


5 An exception to this could be an in-the-money European put option on a non-dividend-paying 
stock or an in-the-money European call option on a currency with a very high interest rate. 
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Figure 8.8 Typical Patterns for Variation of Theta of a European Call Option with Time to 
Maturity 


It makes sense to hedge against changes in the price of an underlying asset, but it does not 
make any sense to hedge against the effect of the passage of time on an option portfolio. 
In spite of this, many traders regard theta as a useful descriptive statistic for a portfolio. In 
a delta-neutral portfolio, when theta is large and positive, gamma tends to be large and 
negative, and vice versa. 


8.5 Rho 


Yet another Greek letter is rho. Rho is the rate of change of a portfolio with respect to 
the level of interest rates. Currency options have two rhos, one for the domestic interest 
rate and one for the foreign interest rate. When bonds and interest rate derivatives are 
part of the portfolio, traders usually consider carefully the ways in which the whole term 
structure of interest rates can change. We discuss this in the next chapter. 


8.6 Calculating Greek Letters 


Appendices E and F explain how Greek letters can be calculated. The RMFI software, 
which can be downloaded from the author’s website, can be used for European and 
American options and some exotic options. Consider again the European call option in 
Section 8.1. The stock price is $49, the strike price is $50, the risk-free rate is 5%, the 
stock price volatility is 20%, and the time to exercise 1s 20 weeks or 20/52 year. Table 8.4 
shows delta, gamma, vega, theta, and rho for the option (i.e., for a long position in one 
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Table 8.4 Greek Letters Calculated Using RMFI Software 


Single Short Position 

Option in 100,000 Options 
Value ($) 2.40 —240,000 
Delta 0.522 —52,200 
Gamma 0.066 —6,600 
Vega (per %) 0.121 —12,100 
Theta (per day) —0.012 1,200 
Rho (per %) 0.089 —8,900 


option) and for a short position in 100,000 options, which was the position considered 
in Tables 8.2 and 8.3. 
Here are some examples of how these numbers can be interpreted: 


1. When there is an increase of $0.1 in the stock price with no other changes, the 
option price increases by about 0.522 x 0.1 or $0.0522. The value of a short position 
in 100,000 options decreases by $5,220. 

2. When there is an increase of $0.1 in the stock price with no other changes, the delta 
of the option increases by about 0.066 x 0.1 or 0.0066. The delta of a short position 
in 100,000 options decreases by 660. 

3. When there is an increase in volatility of 0.5% from 20% to 20.5% with no other 
changes, the option price increases by about 0.121 X 0.5 or $0.0605. The value of a 
short position in 100,000 options decreases by $6,050. 

4. When one day goes by with no changes to the stock price or its volatility, the option 
price decreases by about $0.012. The value of a short position in 100,000 options 
increases by $1,200. 

5. When interest rates increase by 1% (or 100 basis points) with no other changes, the 
option price increases by $0.089. The value of a short position in 100,000 options 
decreases by $8,900. 


8.7 Taylor Series Expansions 


Taylor series expansions are explained in Appendix G. They can be used to show how 
the change in the portfolio value in a short period of time depends on the Greek letters. 
Consider a portfolio dependent on a single asset price, S. If the volatility of the underlying 
asset and interest rates are assumed to be constant, the value of the portfolio, P, is a 
function of S, and time t. The Taylor series expansion gives 


oP ðP 1P, > 1P, @P 
AP = —AS + —At+ AEP + HMAF + HASA H 8.1 
os ot 2 0S? 2 or OSot (i 
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where AP and AS are the change in P and S in a small time interval At. The first term 
on the right-hand side is delta times AS and is eliminated by delta hedging. The second 
term, which is theta times Af, is non-stochastic. The third term can be made zero by 
ensuring that the portfolio is gamma neutral as well as delta neutral. Arguments from 
stochastic calculus show that AS is of order VAt. This means that the third term on the 
right-hand side is of order At. Later terms in the Taylor series expansion are of higher 
order than At. 

For a delta-neutral portfolio, the first term on the right-hand side of equation (8.1) 
is zero, so that 


AP = @At+ STAs? (8.2) 


when terms of higher order than At are ignored. The relationship between the change in 
the portfolio value and the change in the stock price is quadratic, as shown in Figure 8.9. 
When gamma is positive, the holder of the portfolio gains from large movements in the 
asset price and loses when there is little or no movement. When gamma is negative, the 


AP AP 
A A 


(a) (b) 
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(c) (d) 
Figure 8.9 Alternative Relationships between AP and AS for a Delta-Neutral Portfolio 
(a) Slightly positive gamma, (b) large positive gamma, (c) slightly negative gamma, and (d) large 
negative gamma 
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reverse is true so that a large positive or negative movement in the asset price leads to 
severe losses. 


Example 8.2 


Suppose that the gamma of a delta-neutral portfolio of options on an asset is —10,000. 
Suppose that a change of +2 in the price of the asset occurs over a short period of 
time (for which At can be assumed to be zero). Equation (8.2) shows that there is an 
unexpected decrease in the value of the portfolio of approximately 0.5 x 10,000 x 2? = 
$20,000. Note that the same unexpected decrease would occur if there were a change 
of —2. 


When the volatility, o, of the underlying asset is uncertain, P is a function of o, S, 
and t. Equation (8.1) then becomes 


ðP ðP ðP 1P, 1#P, o 
AP = —AS+ —Ao + —At+ ———AS? + HMMA + + 
os Oo Ot 2 dS? 2 do? 


where Ao is the change in o in time At. In this case, delta hedging eliminates the first 
term on the right-hand side. The second term is eliminated by making the portfolio vega 
neutral. The third term is non-stochastic. The fourth term is eliminated by making the 
portfolio gamma neutral. 

Traders often define other “Greek letters” to correspond to higher-order terms in 
the Taylor series expansion. For example, 0? P/0Sdo is known as vanna, 0° P/ 067 is known 
as vomma, and 0*P/0Sdt is known as charm. 


8.8 The Realities of Hedging 


In an ideal world, traders working for financial institutions would be able to rebalance 
their portfolios very frequently in order to maintain a zero delta, a zero gamma, a zero 
vega, and so on. In practice, this is not possible. When managing a large portfolio depen- 
dent on a single underlying asset, traders usually make delta zero, or close to zero at least 
once a day by trading the underlying asset. Unfortunately a zero gamma and a zero vega 
are less easy to achieve because it is difficult to find options or other nonlinear deriva- 
tives that can be traded in the volume required at competitive prices (see discussion of 
dynamic hedging in Business Snapshot 8.2). 

As noted earlier, there are big economies of scale in trading derivatives. Maintaining 
delta neutrality for an individual option on an asset by trading the asset daily would be 
prohibitively expensive. But it is realistic to do this for a portfolio of several hundred 
options on the asset. This is because the cost of daily rebalancing is covered by the profit 
on many different trades. 
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BUSINESS SNAPSHOT 8.2 
Dynamic Hedging in Practice 


Ina typical arrangement at a financial institution, the responsibility for a portfolio 
of derivatives dependent on a particular underlying asset is assigned to one trader 
or to a group of traders working together. For example, one trader at Goldman 
Sachs might be assigned responsibility for all derivatives dependent on the value 
of the Australian dollar. A computer system calculates the value of the portfolio 
and Greek letters for the portfolio. Limits are defined for each Greek letter and 
special permission is required if a trader wants to exceed a limit at the end of a 
trading day. 

The delta limit is often expressed as the equivalent maximum position in the 
underlying asset. For example, the delta limit of Goldman Sachs on a stock might 
be specified as $10 million. If the stock price is $50, this means that the absolute 
value of delta as we have calculated it can be no more that 200,000. The vega limit 
is usually expressed as a maximum dollar exposure per 1% change in the volatility. 

As a matter of course, options traders make themselves delta neutral—or 
close to delta neutral—at the end of each day. Gamma and vega are monitored, 
but are not usually managed on a daily basis. Financial institutions often find 
that their business with clients involves writing options and that as a result they 
accumulate negative gamma and vega. They are then always looking out for 
Opportunities to manage their gamma and vega risks by buying options at com- 
petitive prices. 

There is one aspect of an options portfolio that mitigates problems of man- 
aging gamma and vega somewhat. Options are often close to the money when 
they are first sold so that they have relatively high gammas and vegas. But after 
some time has elapsed, the underlying asset price has often changed enough for 
them to become deep-out-of-the-money or deep-in-the-money. Their gammas 
and vegas are then very small and of little consequence. The nightmare scenario 
for an options trader is where written options remain very close to the money 
as the maturity date is approached. 


8.9 Hedging Exotic Options 


Exotic options (see Section 5.7) can often be hedged using the approach we have out- 
lined. As explained in Business Snapshot 8.3, delta hedging is sometimes easier for exotics 
and sometimes more difficult. When delta hedging is not feasible for a portfolio of exotic 
options an alternative approach known as static options replication is sometimes used. This 
is illustrated in Figure 8.10. Suppose that S denotes the asset price and f denotes time 
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BUSINESS SNAPSHOT 8.3 
Is Delta Hedging Easier or More Difficult for Exotics? 


We can approach the hedging of exotic options by creating a delta-neutral posi- 
tion and rebalancing frequently to maintain delta neutrality. When we do this, 
we find that some exotic options are easier to hedge than plain vanilla options 
and some are more difficult. 

An example of an exotic option that is relatively easy to hedge is an average 
price call option (see Asian options in Section 5.7). As time passes, we observe 
more of the asset prices that will be used in calculating the final average. This 
means that our uncertainty about the payoff decreases with the passage of time. 
As a result, the option becomes progressively easier to hedge. In the final few 
days, the delta of the option always approaches zero because price movements 
during this time have very little impact on the payoff. 

By contrast, barrier options (see Section 5.7) are relatively difficult to hedge. 
Consider a knock-out call option on a currency when the exchange rate is 
0.0005 above the barrier. If the barrier is hit, the option is worth nothing. If 
the barrier is not hit, the option may prove to be quite valuable. The delta of 
the option is discontinuous at the barrier, making conventional hedging very 


difficult. 


Value of exotic option 
portfolio and portfolio P 
is the same at these points. 


< 


S 


Figure 8.10 Static Options Replication 
In a replicating portfolio, P is chosen so that it has the same value as the exotic option 
portfolio at a number of points on a boundary. 
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with the current (t = 0) value of S being Sọ. Static options replication involves choosing 
a boundary in {S, t} space that will eventually be reached and then finding a portfolio 
of regular options that is worth the same as the portfolio of exotic options at a number 
of points on the boundary. The portfolio of exotic options is hedged by shorting this 
portfolio of regular options. Once the boundary is reached, the hedge is unwound. A 
new hedge can then be created with static options replication if desired. 

The theory underlying static options replication is that, if two portfolios are worth 
the same at all {S, t} points on the boundary, they must be worth the same at all the 
{ S, t} points that can be reached prior to the boundary. In practice, values of the original 
portfolio of exotic options and the replicating portfolio of regular options are matched 
at some, but not all, points on the boundary. The procedure therefore relies on the idea 
that, if two portfolios have the same value at a reasonably large number of points on the 
boundary, their values are likely to be close at other points on the boundary. 


8.10 Scenario Analysis 


In addition to monitoring risks such as delta, gamma, and vega, option traders often also 
carry out a scenario analysis. The analysis involves calculating the gain or loss on their 
portfolio over a specified period under a variety of different scenarios. The time period 
chosen is likely to depend on the liquidity of the instruments. The scenarios can be either 
chosen by management or generated by a model. 

Consider a trader with a portfolio of options on a particular foreign currency. There 
are two main variables on which the value of the portfolio depends. These are the 
exchange rate and the exchange rate volatility. Suppose that the exchange rate is cur- 
rently 1.0000 and its volatility is 10% per annum. The bank could calculate a table such 
as Table 8.5 showing the profit or loss experienced during a two-week period under 
different scenarios. This table considers seven different exchange rates and three different 
volatilities. 

In Table 8.5, the greatest loss is in the lower-right corner of the table. The loss 
corresponds to the volatility increasing to 12% and the exchange rate moving up to 1.06. 
Usually the greatest loss in a table such as 8.5 occurs at one of the corners, but this is not 
always so. For example, as we saw in Figure 8.9, when gamma is positive, the greatest loss 
is experienced when the underlying asset price stays where it is. 


Table 8.5 Profit or Loss Realized in Two Weeks under Different Scenarios ($ millions) 
Exchange Rate 


Volatility 0.94 0.96 0.98 1.00 1.02 1.04 1.06 
8% +102 +55 +25 +6 —10 —34 —80 
10% +80 +40 +17 +2 —14 —38 —85 


12% +60 +25 +9 —2 —18 —42 —90 
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Summary 


A trader working for a bank, who is responsible for all the trades involving a particular 
asset, monitors a number of Greek letters and ensures that they are kept within the limits 
specified by the bank. 

The delta, A, of a portfolio is the rate of change of its value with respect to the 
price of the underlying asset. Delta hedging involves creating a position with zero delta 
(sometimes referred to as a delta-neutral position). Because the delta of the underlying 
asset is 1.0, one way of hedging the portfolio is to take a position of —A in the underlying 
asset. For portfolios involving options and more complex derivatives, the position taken 
in the underlying asset has to be changed periodically. This is known as rebalancing. 

Once a portfolio has been made delta neutral, the next stage is often to look at its 
gamma. The gamma of a portfolio is the rate of change of its delta with respect to the 
price of the underlying asset. It is a measure of the curvature of the relationship between 
the portfolio and the asset price. Another important hedge statistic is vega. This measures 
the rate of change of the value of the portfolio with respect to changes in the volatility 
of the underlying asset. Gamma and vega can be changed by trading options on the 
underlying asset. 

In practice, derivatives traders usually rebalance their portfolios at least once a day to 
maintain delta neutrality. It is usually not feasible to maintain gamma and vega neutrality 
on a regular basis. Typically a trader monitors these measures. If they get too large, either 
corrective action is taken or trading is curtailed. 


Further Reading 


Derman, E., D. Ergener, and I. Kani. “Static Options Replication.” Journal of Derivatives 2,no. 4 (Summer 
1995): 78-95. 

Passarelli, D. Trading Option Greeks: How Time Volatility and Other Factors Drive Profits. 2nd ed. Hoboken, 
NJ: John Wiley & Sons, 2012. 

Taleb, N. N. Dynamic Hedging: Managing Vanilla and Exotic Options. New York: John Wiley & Sons, 
1997. 


Practice Questions and Problems 
(Answers at End of Book) 


8.1 The delta of a derivatives portfolio dependent on an index is —2,100. The index 
is currently 1,000. Estimate what happens to the value of the portfolio when the 
index increases to 1,005. 

8.2 The vega of a derivatives portfolio dependent on the dollar—sterling exchange rate 
is 200 (per %). Estimate the effect on the portfolio of an increase in the volatility 
of the exchange rate from 12% to 14%. 

8.3 The gamma of a delta-neutral portfolio is 30. Estimate what happens to the value 
of the portfolio when the price of the underlying asset (a) suddenly increases by $2 
and (b) suddenly decreases by $2. 
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8.4 


8.5 


8.6 


8.7 


8.8 


8.9 


8.10 


8.15 


What does it mean to assert that the delta of a call option is 0.7? How can a short 
position in 1,000 options be made delta neutral when the delta of a long position 
in each option is 0.7? 
What does it mean to assert that the theta of an option position is —100 per day? If 
a trader feels that neither a stock price nor its implied volatility will change, what 
type of option position is appropriate? 
What is meant by the gamma of an option position? What are the risks in the 
situation where the gamma of a position is large and negative and the delta is zero? 
“The procedure for creating an option position synthetically is the reverse of the 
procedure for hedging the option position.” Explain this statement. 
A company uses delta hedging to hedge a portfolio of long positions in put and call 
options on a currency. Which of the following would lead to the most favorable 
result? 
(a) A virtually constant spot rate 
(b) Wild movements in the spot rate 

How does your answer change if the portfolio contains short option positions? 
A bank’s position in options on the dollar—euro exchange rate has a delta of 30,000 
and a gamma of —80,000. Explain how these numbers can be interpreted. The 
exchange rate (dollars per euro) is 0.90. What position would you take to make 
the position delta neutral? After a short period of time, the exchange rate moves to 
0.93. Estimate the new delta. What additional trade is necessary to keep the position 
delta neutral? Assuming the bank did set up a delta-neutral position originally, has 
it gained or lost money from the exchange-rate movement? 
“Static options replication assumes that the volatility of the underlying asset will 
be constant.” Explain this statement. 
Suppose that a trader using the static options replication technique wants to match 
the value of a portfolio of exotic derivatives with the value of a portfolio of regular 
options at 10 points on a boundary. How many regular options are likely to be 
needed? Explain your answer. 
Why is an Asian option easier to hedge than a regular option? 
Explain why there are economies of scale in hedging options. 
Consider a six-month American put option on a foreign currency when 
the exchange rate (domestic currency per foreign currency) is 0.75, the strike 
price is 0.74, the domestic risk-free rate is 5%, the foreign risk-free rate is 3%, 
and the exchange-rate volatility is 14% per annum. Use the RMFI software 
(binomial tree with 100 steps) to calculate the price, delta, gamma, vega, theta, and 
rho of the option. Verify that delta is correct by changing the exchange rate to 
0.751 and recomputing the option price. 


Further Questions 


The gamma and vega of a delta-neutral portfolio are 50 and 25, respectively, where 
the vega is “per %.” Estimate what happens to the value of the portfolio when there 
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8.16 


8.17 


8.18 


8.19 


is a shock to the market causing the underlying asset price to decrease by $3 and 
its volatility to increase by 4%. 

Consider a one-year European call option on a stock when the stock price is $30, 
the strike price is $30, the risk-free rate is 5%, and the volatility is 25% per annum. 
Use the RMFI software to calculate the price, delta, gamma, vega, theta, and rho 
of the option. Verify that delta is correct by changing the stock price to $30.1 and 
recomputing the option price. Verify that gamma is correct by recomputing the 
delta for the situation where the stock price is $30.1. Carry out similar calculations 
to verify that vega, theta, and rho are correct. 

A financial institution has the following portfolio of over-the-counter options on 
pounds sterling: 


Delta of Gamma of Vega of 


Type Position Option Option Option 
Call —1,000 0.50 2.2 1.8 
Call —500 0.80 0.6 0.2 
Put —2,000 —0.40 1.3 0.7 
Call —500 0.70 1.8 1.4 


A traded option is available with a delta of 0.6, a gamma of 1.5, and a vega of 0.8. 

(a) What position in the traded option and in pounds sterling would make the 
portfolio both gamma neutral and delta neutral? 

(b) What position in the traded option and in pounds sterling would make the 
portfolio both vega neutral and delta neutral? 

Consider again the situation in Problem 8.17. Suppose that a second traded option, 

with a delta of 0.1, a gamma of 0.5, and a vega of 0.6, is available. How could the 

portfolio be made delta, gamma, and vega neutral? 

Check the first three rows of Table 8.2 using the RMFI software. Calculate the 

gamma and theta of the position during the first three weeks, and the change in 

the value of the position (before the end-of-week rebalancing) during each of these 

weeks. Check whether equation (8.2) is approximately satisfied. (Note: The software 

produces a value of theta per calendar day. The theta in equation (8.2) is per year.) 


Chapter 9 


Interest Rate Risk 


nterest rate risk is more difficult to manage than the risk arising from market vari- 

ables such as equity prices, exchange rates, and commodity prices. One complication 

is that there are many different interest rates in any given currency (Treasury rates, 
interbank borrowing and lending rates, swap rates, and so on). Although these tend to 
move together, they are not perfectly correlated. Another complication is that we need 
more than a single number to describe the interest rate environment. We need a func- 
tion describing the variation of the interest rate with maturity. This is known as the 
term structure of interest rates or the yield curve. The interest rate considered is usually a 
zero-coupon interest rate, sometimes just referred to as a zero. It is the rate of interest that 
would apply for the maturity being considered if all interest and principal are paid at 
the end (i.e., there are no periodic payments such as those that are usually made on 
bonds). 

Consider, for example, the situation of a U.S. government bond trader. The trader’s 
portfolio is likely to consist of many bonds with different maturities. There is an exposure 
to movements in the one-year rate, the two-year rate, the three-year rate, and so on. The 
trader’s delta exposure is therefore more complicated than that of the gold trader in Table 
8.1. He or she must be concerned with all the different ways in which the U.S. Treasury 
yield curve can change its shape through time. 

This chapter starts with a description of traditional approaches used by a financial 
institution to manage interest rate risk. It explains some of the interest rates that are 
important to financial institutions. It then covers duration and convexity measures. These 
can be regarded as the interest rate equivalents of the delta and gamma measures con- 
sidered in the previous chapter. A number of different approaches to managing the risks 
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of nonparallel shifts are then presented. These include the use of partial durations, the 
calculation of multiple deltas, and the use of principal components analysis. 

This chapter does not cover issues such as the compounding frequency with which 
interest rates are measured and how the term structure of zero-coupon interest rates is 
calculated. These topics are covered in Appendices A and B. 


9.1 The Management of Net Interest Income 


A key risk management activity for a bank is the management of net interest income. As 
explained in Section 2.2, the net interest income is the excess of interest received over 
interest paid. It is the role of the asset-liability management function within the bank 
to ensure that the net interest margin, which is net interest income divided by income- 
producing assets, remains roughly constant through time. This section considers how this 
is done. 

How can fluctuations in net interest margin occur? Consider a simple situation where 
a bank offers consumers a one-year and a five-year deposit rate as well as a one-year and 
five-year mortgage rate. The rates are shown in Table 9.1. We make the simplifying 
assumption that the expected one-year interest rate for future time periods equals the 
one-year rate prevailing in the market today. Loosely speaking, this means that market 
participants consider interest rate increases to be just as likely as interest rate decreases. 
As a result, the rates in Table 9.1 are fair in that they reflect the market’s expectations. 
Investing money for one year and reinvesting for four further one-year periods leads to 
an uncertain return. But, given our assumptions, the expected overall return is the same 
as a single five-year investment. Similarly, borrowing money for one year and refinancing 
each year for the next four years leads to the same expected financing costs as a single 
five-year loan. 

Suppose you have money to deposit and agree with the prevailing view that interest 
rate increases are just as likely as interest rate decreases. Would you choose to deposit your 
money for one year at 3% per annum or for five years at 3% per annum? The chances 
are that you would choose one year because this gives you more financial flexibility. It 
ties up your funds for a shorter period of time. 

Now suppose that you want a mortgage. Again you agree with the prevailing view 
that interest rate increases are just as likely as interest rate decreases. Would you choose a 
one-year mortgage at 6% or a five-year mortgage at 6%? The chances are that you would 


Table 9.1 Example of Rates Offered by a Bank 
to Its Customers 


Maturity Deposit Mortgage 
(years) Rate Rate 
1 3% 6% 


3% 6% 
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choose a five-year mortgage because it fixes your borrowing rate for the next five years 
and subjects you to less refinancing risk. 

When the bank posts the rates shown in Table 9.1, it is likely to find that the majority 
of its depositors opt for a one-year maturity and the majority of the customers seeking 
mortgages opt for a five-year maturity. This creates an asset/liability mismatch for the 
bank and subjects its net interest income to risks. The deposits that are financing the 
five-year 6% mortgages are rolled over every year. There is no problem if interest rates 
fall. After one year, the bank will find itself financing the five-year 6% mortgages with 
deposits that cost less than 3% and net interest income will increase. However, if inter- 
est rates rise, the deposits that are financing the 6% mortgages will cost more than 3% 
and net interest income will decline. Suppose that there is a 3% rise in interest rates 
during the first two years. This would reduce net interest income for the third year 
to zero. 

It is the job of the asset-liability management group to ensure that this type of 
interest rate risk is minimized. One way of doing this is to ensure that the maturi- 
ties of the assets on which interest is earned and the maturities of the liabilities on 
which interest is paid are matched. In our example, the matching can be achieved by 
increasing the five-year rate on both deposits and mortgages. For example, the bank 
could move to the situation in Table 9.2 where the five-year deposit rate is 4% and 
the five-year mortgage rate is 7%. This would make five-year deposits relatively more 
attractive and one-year mortgages relatively more attractive. Some customers who chose 
one-year deposits when the rates were as in Table 9.1 will choose five-year deposits 
when rates are as in Table 9.2. Some customers who chose five-year mortgages when 
the rates were as in Table 9.1 will choose one-year mortgages. This may lead to the 
maturities of assets and liabilities being matched. If there is still an imbalance with depos- 
itors tending to choose a one-year maturity and borrowers a five-year maturity, five-year 
deposit and mortgage rates could be increased even further. Eventually the imbalance will 
disappear. 

The net result of all banks behaving in the way we have just described is that long- 
term rates tend to be higher than those predicted by expected future short-term rates. 
This phenomenon is referred to as liquidity preference theory. It leads to long-term rates 
being higher than short-term rates most of the time. Even when the market expects a 
small decline in short-term rates, liquidity preference theory is likely to cause long-term 
rates to be higher than short-term rates. Only when a steep decline in interest rates is 
expected will long-term rates be lower than short-term rates. 


Table 9.2 Five-Year Rates Are Increased in an 
Attempt to Match Maturities of Assets and Liabilities 


Maturity Deposit Mortgage 
(years) Rate Rate 
1 3% 6% 


4% 7% 


188 RISK MANAGEMENT AND FINANCIAL INSTITUTIONS 


Many banks now have sophisticated systems for monitoring the decisions being made 
by customers so that, when they detect small differences between the maturities of the 
assets and liabilities being chosen, they can fine-tune the rates they offer. Often derivatives 
such as interest rate swaps are used to manage their exposures (see Example 5.1 in Section 
5.5.3). The result of all this is that net interest margin is usually stable. This has not 
always been the case. In the 1980s in the United States, the failures of savings and loans 
companies were largely a result of their failure to match maturities for assets and liabilities. 


9.1.1 Liquidity 


In addition to eroding net interest margin, a mismatch of assets and liabilities can lead to 
liquidity problems. A bank that funds long-term loans with short-term deposits has to 
replace maturing deposits with new deposits on a regular basis. (This is sometimes referred 
to as rolling over the deposits.) If depositors lose confidence in the bank, it might find it 
difficult to do this. A well-known example of a financial institution that failed because of 
liquidity problems is Northern Rock in the United Kingdom. It chose to finance much of 
its mortgage portfolio with wholesale deposits, some lasting only three months. Starting 
in September 2007, the depositors became nervous because of the problems surfacing in 
the United States. As a result, Northern Rock was unable to finance its assets and was 
taken over by the UK government in early 2008 (see Business Snapshot 24.1 for more 
details). In the United States, Bear Stearns and Lehman Brothers experienced similar 
problems in rolling over their wholesale deposits. 

Many of the problems during the credit crisis that started in 2007 were caused by 
a shortage of liquidity. As often happens during stressed market conditions, there was 
a flight to quality where investors looked for very safe investments and were not pre- 
pared to take credit risks. Bank regulators have now recognized the need to set liquidity 
requirements, as well as capital requirements, for banks. Chapter 16 explains the Basel HI 
liquidity requirements and Chapter 24 discusses liquidity issues in more detail. 


9.2 Types of Rates 


In this section, we explain a number of interest rates that are important to financial 
institutions. 


9.2.1 Treasury Rates 


Treasury rates are the rates an investor earns on Treasury bills and Treasury bonds. These 
are the instruments used by a government to borrow in its own currency. Japanese Trea- 
sury rates are the rates at which the Japanese government borrows in yen, U.S. Treasury 
rates are the rates at which the U.S. government borrows in U.S. dollars, and so on. It 
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is usually assumed that there is no chance that a government will default on an obliga- 
tion denominated in its own currency.' Treasury rates are therefore usually regarded as 
risk-free rates in the sense that an investor who buys a Treasury bill or Treasury bond is 
certain that interest and principal payments will be made as promised. 


9.2.2 LIBOR 


LIBOR is short for London interbank offered rate. It is an unsecured short-term borrow- 
ing rate between banks. LIBOR rates are quoted for a number of different curren- 
cies and borrowing periods. The borrowing periods range from one day to one year. 
LIBOR rates are used as reference rates for hundreds of trillions of dollars of trans- 
actions throughout the world. One popular and important derivative transaction that 
uses LIBOR as a reference interest rate is an interest rate swap (see Chapter 5). LIBOR 
rates are compiled by asking 18 global banks to provide quotes estimating the rate of 
interest at which they could borrow funds from other banks just prior to 11:00 a.m. 
(UK time). The highest four and lowest four of the quotes for each currency/borrowing 
period are discarded and the remaining ones are averaged to determine LIBOR fixings 
for the day. The banks submitting the quotes typically have an AA credit rating. LIBOR 
is therefore usually assumed to be an estimate of the unsecured borrowing rate for an 
AA-rated bank. 

Some traders working for banks have been investigated for attempting to manipulate 
LIBOR quotes. Why might they do this? Suppose that the payoff to a bank from a 
derivative depends on the LIBOR fixing on a particular day, with the payoff increasing 
as the fixing increases. It is tempting for a trader to provide a high quote on that day and to 
try to persuade other banks to do the same. Tom Hayes was the first trader to be convicted 
of LIBOR manipulation. In August 2015, he was sentenced to 14 years (later reduced to 
11 years) in prison by a court in the United Kingdom. A problem with the system was 
that there was not enough interbank borrowing for banks to make accurate estimates 
of their borrowing rates for all the quotes that were required, and some judgment was 
inevitably necessary. In an attempt to improve things, the number of different currencies 
has been reduced from 10 to 5 and the number of different borrowing periods has been 
reduced from 15 to 7. Also, regulatory oversight of the way the borrowing estimates are 
produced has been improved. 

It is now recognized that LIBOR is a less-than-ideal reference rate for derivatives 
transactions because it is determined from estimates made by banks, not from actual 
market transactions. The derivatives market is investigating the use of other reference 
rates such as OIS rates (which will be discussed shortly). 


'This is because a government controls the money supply in its own currency (i.e., it can print its 
own money). But governments are liable to default on debt in a foreign currency. Also, countries in 
the European Union are liable to default on debt denominated in euros. 
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9.2.3 The LIBOR/Swap Zero Curve 


As mentioned, LIBOR quotes last between one day and one year. They therefore define 
in a direct way the zero-coupon LIBOR yield curve for maturities up to one year. 
How can the LIBOR yield curve be extended beyond one year? There are two possible 
approaches: 


1. Create a yield curve to represent the rates at which AA-rated companies can today 
borrow funds for periods of time longer than one year. 

2. Create a yield curve to represent the future short-term borrowing rates for AA-rated 
companies. 


It is important to understand the difference. Suppose that the yield curve is 4% for all 
maturities. If the yield curve is created in the first way, this means that companies rated 
AA today can lock in an interest rate of 4% regardless of how long they want to borrow. 
If the yield curve is created in the second way, the forward interest rate that the market 
assigns to the short-term borrowing rates of companies that will be rated AA at a future 
time is 4%. (See Appendix B for how forward rates are defined and calculated.) When 
the yield curve is created in the first way, it gives the forward short-term borrowing rate 
for a company that is AA-rated today. When it is created in the second way, it gives the 
forward short-term borrowing rate for a company that will be AA at the beginning of 
the period covered by the forward contract. 

In practice, the LIBOR yield curve is extended using the second approach. Swap rates 
(see Table 5.5) are used to extend the LIBOR yield curve, as described in Appendix B? 
The resulting yield curve is sometimes called the LIBOR yield curve, sometimes the 
swap yield curve, and sometimes the LIBOR/swap yield curve. To understand why swap 
rates can be used to extend the LIBOR yield curve when the second approach is used, 
note that a bank can convert a series of short-term LIBOR loans to a swap rate using 
the swap market. For example, it can 


1. Lend a certain principal for six months to an AA borrower and relend it for nine 
successive six-month periods to (possibly different) borrowers who are rated AA at 
the time of their loans; and 

2. Enter into a swap to exchange the LIBOR for the five-year swap rate. 


This means that the swap rate represents what the bank can expect to earn from a series 
of short-term loans to AA-rated borrowers at LIBOR. It is sometimes referred to as a 
continually refreshed rate? 


? Eurodollar futures, which are contracts on the future value of LIBOR, can also be used to extend 
the LIBOR yield curve. 

3See P Collin-Dufresne and B. Solnik, “On the Term Structure of Default Premia in the Swap and 
Libor Market,” Journal of Finance 56, no. 3 (June 2001): 1095-1115. 
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9.2.4 LIBOR vs. Treasury Rates 


Risk-free rates are important in the pricing of financial contracts. Treasury rates might 
be thought to be natural rates to use as risk-free rates, but in practice they are regarded 
as artificially low because: 


1. The amount of capital a bank is required to hold to support an investment in Treasury 
bills and bonds (typically zero) is substantially smaller than the capital required to 
support a similar investment in other very-low-risk instruments. 

2. In the United States, Treasury instruments are given a favorable tax treatment com- 
pared with most other fixed-income investments because they are not taxed at the 
state level. 


Prior to the credit crisis that started in 2007, financial institutions used LIBOR and 
swap rates as a proxies for risk-free rates. Since the crisis, they have switched to using 
overnight indexed swap (OIS) rates for this purpose. We now explain how OIS rates are 
determined. 


9.2.5 The OIS Rate 


An overnight indexed swap (OIS) is a swap where a fixed interest rate for a period 
(e.g., one month, three months, one year, or two years) is exchanged for the geometric 
average of overnight rates during the period.* The relevant overnight rates are the rates 
in the government-organized interbank market where banks with excess reserves lend 
to banks that need to borrow to meet their reserve requirements.” In the United States, 
the overnight borrowing rate in this market is known as the fed funds rate. The effective 
fed funds rate on a particular day is the weighted average of the overnight rates paid by 
borrowing banks to lending banks on that day. This is what is used in the OIS geometric 
average calculations. Many other countries have similar overnight markets. For example, 
the Eonia (Euro OverNight Index Average) is the European equivalent of the effective 
fed funds rate; the SONIA (Sterling OverNight Index Average) is the British equivalent; 
and so on. 

If during a certain period a bank borrows at the overnight rate (rolling the loan and 
interest forward each day), it pays the geometric average of the overnight interest rates 
for the period. Similarly, if it lends at the overnight rate every day, it receives the geo- 
metric average of the overnight rates for the period. An OIS therefore allows overnight 
borrowing or lending to be swapped for borrowing or lending at a fixed rate for a period 
of time. The fixed rate is referred to as the OIS rate. 


‘The term “geometric average of overnight rates” should here be interpreted as “geometric average 
of one plus the overnight rates minus one.” (See Business Snapshot 4.1.) 

5 Central banks require commercial banks to keep a certain percentage of customer deposits as reserves 
that cannot be lent out. The reserves can take the form of cash or deposits with the central bank. 
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Figure 9.1 The Three-Month LIBOR-OIS Spread, January 2002 to June 2017 in Basis Points 


A key indicator of stress in the banking system is the LIBOR-OIS spread. This is the 
amount by which the three-month London Interbank Offered Rate (LIBOR) exceeds 
the three-month overnight indexed swap (OIS) rate. As discussed, the former is the rate 
of interest at which a bank will extend unsecured credit to an AA-rated bank for a term 
of three months. The latter is the rate of interest at which funds can be borrowed by a 
bank for three months by using overnight borrowings at the fed funds rate of interest 
in conjunction with a swap that converts the overnight borrowing to three-month bor- 
rowing. Banks can in theory borrow at the three-month OIS rate and lend the funds 
to an AA-rated bank at the three-month LIBOR rate of interest. Assuming no risk in 
overnight borrowing and no risk in the OIS swap (e.g., because it is well collateralized), 
the LIBOR-OIS spread is therefore a credit spread that compensates lenders for the 
possibility that an AA-rated bank might default during a three-month period. In nor- 
mal market conditions, the LIBOR-OIS spread is less than 10 basis points (annualized). 
The larger the LIBOR-OIS spread, the greater the reluctance of banks to lend to each 
other. 

Figure 9.1 shows the LIBOR-OIS spread between January 2002 and June 2017. Prior 
to August 2007, the LIBOR-OIS spread was less than 10 basis points. In August 2007, 
as problems in the U.S. housing market became apparent and banks became increasingly 
reluctant to lend to each other, it started to increase. It reached a peak of 364 basis points 
in early October 2008. By a year later, it had returned to more normal levels. 


9.2.6 Repo Rates 


Unlike LIBOR and federal funds rates, repo rates are secured borrowing rates. In a repo 
(or repurchase agreement), a financial institution that owns securities agrees to sell the 
securities for a certain price and to buy them back at a later time for a slightly higher 
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price. The financial institution is obtaining a loan, and the interest it pays is the differ- 
ence between the price at which the securities are sold and the price at which they are 
repurchased. The interest rate is referred to as the repo rate. 

If structured carefully, a repo involves very little credit risk. If the borrower does not 
honor the agreement, the lending company simply keeps the securities. If the lending 
company does not keep to its side of the agreement, the original owner of the securities 
keeps the cash provided by the lending company. The most common type of repo is an 
overnight repo, which may be rolled over day to day. However, longer-term arrangements, 
known as term repos, are sometimes used. Because they are secured rates, a repo rate is 
generally a few basis points below the corresponding LIBOR or fed funds rate. 


9.3 Duration 


Duration is a widely used measure of a portfolio’s exposure to yield curve movements. 
Suppose y is a bond’s yield and B is its market price. The duration D of the bond is 
defined as 


D= _14B (9.1) 
B Ay 
so that 
AB = —DBAy 


where Ay is a small change in the bond’s yield and AB is the corresponding change in 
its price. Duration measures the sensitivity of percentage changes in the bond’s price to 
changes in its yield. Using calculus notation, we can write 


1 dB 
D=--— (9.2) 
B dy 
Consider a bond that provides cash flows ¢,, &,...,¢, at times t, f,...,¢,. (The cash 


flows consist of the coupon and principal payments on the bond.) The bond yield, y, is 
defined as the discount rate that equates the bond’s theoretical price to its market price. 
We denote the present value of the cash flow c,, discounted from time t; to today at rate 
y, by v; so that the price of the bond is 


n 


B= >, 


i=1 
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An alternative definition of duration is 


G= D(z) (9.3) 


The term in parentheses in equation (9.3) is the ratio of the present value of the 
cash flow at time t; to the bond price. Equation (9.3) therefore defines duration as a 
weighted average of the times when payments are made, with the weight applied to time 
t; being equal to the proportion of the bond’s total present value provided by the cash 
flow at time t;. (The sum of the weights is 1.0.) This definition explains where the term 
duration comes from. Duration is a measure of how long the bondholder has to wait 
for cash flows. A zero-coupon bond that lasts n years has a duration of n years. However, 
a coupon-bearing bond lasting n years has a duration of less than n years, because the 
holder receives some of the cash payments prior to year n. 

If the bond’s yield, y, in equation (9.1) is measured with continuous compounding, it 
turns out that the definitions of duration in equations (9.1) and (9.3) are the same. (See 
Problem 9.15.) 

Consider a three-year 10% coupon bond with a face value of $100. Suppose that 
the yield on the bond is 12% per annum with continuous compounding. This means 
that y = 0.12. Coupon payments of $5 are made every six months. Table 9.3 shows the 
calculations necessary to determine the bond’s duration. The present values of the bond’s 
cash flows, using the yield as the discount rate, are shown in column 3. (For example, 
the present value of the first cash flow is 5e~°:!?*°° = 4.709.) The sum of the numbers 
in column 3 is the bond’s market price, 94.213. The weights are calculated by dividing 
the numbers in column 3 by 94.213. The sum of the numbers in column 5 gives the 
duration as 2.653 years. 

Small changes in interest rates are often measured in basis points. A basis point is 
0.01% per annum. The following example shows that equation (9.1) is correct when 
duration is defined as in equation (9.3) and yields are measured with continuous 
compounding. 


Table 9.3 Calculation of Duration 
Time Cash Flow 


(years) ($) Present Value Weight Time x Weight 
0.5 5 4.709 0.050 0.025 
1.0 5 4.435 0.047 0.047 
1.5 5 4.176 0.044 0.066 
2.0 5 3.933 0.042 0.083 
2.5 5 3.704 0.039 0.098 
3.0 105 73.256 0.778 2.333 


Total 130 94.213 1.000 2.653 
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Example 9.1 


For the bond in Table 9.3, the bond price, B, is 94.213 and the duration, D, is 2.653, so 
that equation (9.1) gives 


AB = —94.213 x 2.653Ay 
or 
AB = —249.95Ay 


When the yield on the bond increases by 10 basis points (= 0.1%), Ay = +0.001. The 
duration relationship predicts that AB = —249.95 x 0.001 = —0.250 so that the bond 
price goes down to 94.213 — 0.250 = 93.963. How accurate is this? When the bond 
yield increases by 10 basis points to 12.1%, the bond price is 


—0. A —0. i —0. .5 —0. i 
5e 0.121x0.5 4 5e 0.121x1.0 4 5e ).121x1.5 4 5e 0.121X2.0 


4570-12125 4. 105e70121x3.0 — 93 963 


which is (to three decimal places) the same as that predicted by the duration relationship. 


9.3.1 Modified Duration 


The definition of duration in equation (9.3) was suggested by Frederick Macaulay in 
1938. It is referred to as Macaulay’s duration. As mentioned, when the yield y on the 
bond is measured with continuous compounding, it is equivalent to the definition in 
equation (9.1). When duration is defined using equations (9.1) and other compounding 
frequencies are used for y, a small adjustment is necessary to Macaulay’s duration. When 
y 1s measured with annual compounding, it can be shown that the expression for D 
in equation (9.3) must be divided by 1 + y. More generally, when y is expressed with a 
compounding frequency of m times per year, it must be divided by 1 + y/m. (See Problem 
9.15.) Duration defined with these adjustments to equation (9.3) is referred to as modified 
duration. 


Example 9.2 


The bond in Table 9.3 has a price of 94.213 and a duration of 2.653. The yield, expressed 
with semiannual compounding, 1s 12.3673%. (See Appendix A.) The (modified) duration 
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appropriate for calculating sensitivity to the yield when it is expressed with semiannual 
compounding is 


2.653 


———— = 2.4985 
1 + 0.123673/2 


From equation (9.1), 
AB = —94.213 x 2.4985Ay 
or 
AB = —235.39Ay 


When the yield (semiannually compounded) increases by 10 basis points (= 0.1%), Ay = 
+0.001. The duration relationship predicts that we expect AB to be —235.39 x 0.001 = 
—0.235 so that the bond price goes down to 94.213 — 0.235 = 93.978. How accurate is 
this? When the bond yield (semiannually compounded) increases by 10 basis points to 
12.4673% (or to 12.0941% with continuous compounding), an exact calculation similar 
to that in the previous example shows that the bond price becomes 93.978. This shows 
that the modified duration is accurate for small yield changes. 


9.3.2 Dollar Duration 


The dollar duration of a bond is defined as the product of its duration and its price. If 
Dy is dollar duration, it follows from equation (9.1) that 


AB = —D,Ay 


or using calculus notation 


Whereas duration relates proportional changes in a bond’s price to its yield, dollar 
duration relates actual changes in the bond’s price to its yield. Dollar duration is similar 
to the delta measure discussed in Chapter 8. 


9.4 Convexity 


The duration relationship measures exposure to small changes in yields. This is illustrated 
in Figure 9.2, which shows the relationship between the percentage change in value and 
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Figure 9.2 Two Bonds with the Same Duration and Different Convexities 


change in yield for two bonds with the same duration. The gradients of the two curves 
are the same at the origin. This means that both portfolios change in value by the same 
percentage for small yield changes, as predicted by equation (9.1). For large yield changes, 
the bonds behave differently. Bond X has more curvature in its relationship with yields 
than bond Y. A factor known as convexity measures this curvature and can be used to 
improve the relationship between bond prices and yields. 

The convexity for a bond is 


—yt, 


afn din GO 
B dy? B 


where y is the bond’s yield measured with continuous compounding. This is the weighted 
average of the square of the time to the receipt of cash flows. From Appendix G, a second 
order approximation to the change in the bond price is 


This leads to 


AB _ —~DAy+ SctAyy? (9.4) 
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Example 9.3 


Consider again the bond in Table 9.3. The bond price, B, is 94.213, and the duration, D, 
is 2.653. The convexity is 


0.05 x 0.57 + 0.047 x 1.07 + 0.044 x 1.57 + 0.042 x 2.07 + 0.039 x 2.57 
+ 0.779 x 3.07 = 7.570 


The convexity relationship in equation (9.4) is therefore 


Az = —2.653Ay + 5 X 7.570 x (Ay)? 


Consider a 2% change in the bond yield from 12% to 14%. The duration relationship 
predicts that the dollar change in the value of the bond will be —94.213 x 2.653 x 0.02 = 
—4.999. The convexity relationship predicts that it will be 


—94.213 x 2.653 X 0.02 + 0.5 X 94.213 x 7.570 X 0.027 = —4.856 


The actual change in the value of the bond is —4.859. This shows that the convexity 
relationship gives much more accurate results than duration for a large change in the 


bond yield. 


9.4.1 Dollar Convexity 


The dollar convexity of a bond, Cs, can be defined analogously to dollar duration as the 
product of convexity and the value of the bond. This means that 


and shows that dollar convexity is similar to the gamma measure introduced in Chapter 8. 


9.5 Generalization 


So far we have used duration and convexity to measure the sensitivity of the price of 
a single bond to interest rates. The definitions of duration and convexity can be gener- 
alized so that they apply to a portfolio of bonds—or to any portfolio of interest-rate- 
dependent instruments. We define a parallel shift in the zero-coupon yield curve as a 
shift where all zero-coupon interest rates change by the same amount, as indicated in 
Figure 9.3. 
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Figure 9.3 A Parallel Shift in Zero Rates 


Suppose that P is the value of the portfolio of interest-rate-dependent securities. We 
can make a small parallel shift in the zero-coupon yield curve and observe the change 
AP in P. Duration is defined as 


D=-—— (9.5) 


where Ay is the size of the small parallel shift. Equation (9.5) is equivalent to 


AP 
— = -DA 9.6 
P Y (9.6) 


Suppose a portfolio consists of a number of interest-rate-dependent assets. The ith 
asset is worth X; and has a duration D; (i = 1,2, ..., n). Define AX; as the change in 
the value of X; arising from the yield curve shift Ay. It follows that P =)", X; and 
AP = ¥'"_, AX; so that from equation (9.5) the duration of the portfolio is given by 


1 n AX, 
D=- Y — 
Pm Ay 
The duration of the ith asset is 
1 AX; 
© X, Ay 
Hence 
n X. 
p=Y—pD 
Zig 


® A small parallel shift of Ay in the zero-coupon yield curve leads to the yield of all bonds changing 
by approximately Ay. 
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This shows that the duration D of a portfolio is the weighted average of the durations of 
the individual assets comprising the portfolio with the weight assigned to an asset being 
proportional to the value of the asset. 

The dollar duration Dg of a portfolio can be defined as duration of the portfolio 
times the value of the portfolio: 


This is a measure of the delta of the portfolio with respect to interest rates. The dollar 
duration of a portfolio consisting of a number of interest-rate-dependent assets is the 
sum of the dollar durations of the individual assets. 

The convexity measure can be generalized in the same way as duration. For any 
interest-rate-dependent portfolio whose value is P we define the convexity C as 1/P 
times the second partial derivative of the value of the portfolio with respect to a parallel 
shift in the zero-coupon yield curve. Equation (9.4) is correct with B replaced by P: 


AP 1 2 
— = -—DAy+ =C(Ay)* 9.7 
> y+ 5 Clay) 9.7) 


The relationship between the convexity of a portfolio and the convexity of the assets 
comprising the portfolio is similar to that for duration: the convexity of the portfolio is 
the weighted average of the convexities of the assets with the weights being proportional 
to the value of the assets. For a portfolio with a particular duration, the convexity tends 
to be greatest when the portfolio provides payments evenly over a long period of time. 
It is least when the payments are concentrated around one particular point in time. 

The dollar convexity for a portfolio worth P can be defined as P times the convexity. 
This a measure of the gamma of the portfolio with respect to interest rates. The dollar 
convexity of a portfolio consisting of a number of interest-rate-dependent positions is 
the sum of the dollar convexities of the individual assets. 


9.5.1 Portfolio Immunization 


A portfolio consisting of long and short positions in interest-rate-dependent assets can 
be protected against relatively small parallel shifts in the yield curve by ensuring that its 
duration is zero. It can be protected against relatively large parallel shifts in the yield curve 
by ensuring that its duration and convexity are both zero or close to zero. 


9.6 Nonparallel Yield Curve Shifts 


Unfortunately the basic duration relationship in equation (9.6) only quantifies exposure 
to parallel yield curve shifts. The duration plus convexity relationship in equation (9.7) 
allows the shift to be relatively large, but it is still a parallel shift. 
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Table 9.4 Zero-Coupon Yield Curve (rates 
continuously compounded) 


Maturity (years) 1 2 3 4 5 7 10 


Rate (%) 40 45 48 5.0 5.1 5.2 5.3 
67 Zero rate (%) 
5- 
4 
34 
24 
14 
Maturity (years) 
wo a a s a”; o d 


Figure 9.4 The Zero-Coupon Yield Curve (as shown in Table 9.4) 


9.6.1 Partial Duration 


Some researchers have extended duration measures so that nonparallel shifts can be con- 
sidered. Reitano (1992) suggests a partial duration measure where just one point on the 
zero-coupon yield curve is shifted and all other points remain the same” Suppose that the 
zero curve is as shown in Table 9.4 and Figure 9.4. Shifting the five-year point involves 
changing the zero curve as indicated in Figure 9.5. In general, the partial duration of the 
portfolio for the ith point on the zero curve is 

1 AP, 


i P Ay; 


where Ay; is the size of the small change made to the ith point on the yield curve and 
AP, is the resulting change in the portfolio value. The sum of all the partial duration 
measures equals the usual duration measure The percentage change in the portfolio 
value arising from Ay; is —D;Ay;. 

Suppose that the partial durations for a particular portfolio are as shown in Table 
9.5. The duration of the portfolio (sum of the partial durations) is only 0.2. This means 
that the portfolio is relatively insensitive to parallel shifts in the yield curve. However, the 
durations for short maturities are positive while those for long maturities are negative. 


7 See R. Reitano, “Nonparallel Yield Curve Shifts and Immunization,” Journal of Portfolio Management 
(Spring 1992): 36—43. 

8 When the ith point on the zero curve is shifted, the other points are not shifted and rates on the 
shifted yield curve are calculated using linear interpolation as indicated in Figure 9.5. 
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Figure 9.5 Change in Zero-Coupon Yield Curve When One Point Is Shifted 


Table 9.5 Partial Durations for a Portfolio 


Maturity (years) 1 2 3 4 5 7 10 Total 
Duration 0.2 06 0.9 16 2.0 —2.1 -3.0 0.2 


This means that the portfolio loses (gains) in value when short rates rise (fall). It gains 
(loses) in value when long rates rise (fall). 

We are now in a position to go one step further and calculate the sensitivity of a 
portfolio value to any nonparallel shifts. Suppose that, in the case of the yield curve 
shown in Figure 9.4, we define a rotation where the changes to the 1-year, 2-year, 3- 
year, 4-year, 5-year, 7-year, and 10-year are —3e, —2e, —e, 0, e, 3e, and 6e for some small e. 
This is illustrated in Figure 9.6. From the partial durations in Table 9.5, the percentage 
change in the value of the portfolio arising from the rotation is 


—[0.2 x (—3e) + 0.6 X (—2e) + 0.9 X (—e) + 1.6 X 0 + 2.0 X e 
—2.1 x 3e — 3.0 x 6e] = 25.0e 


| Zero rate (%) 


Maturity (years) 
0 T T T T T 1 
0 2 4 6 8 10 12 


Figure 9.6 A Rotation of the Yield Curve 
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Figure 9.7 Change Considered to Yield Curve When Bucketing Approach Is Used 


For a parallel shift of e in the yield curve, the percentage change in the value of the 
portfolio is —0.2e. This shows that a portfolio that gives rise to the partial durations in 
Table 9.5 is much more heavily exposed to a rotation of the yield curve than to a parallel 
shift. 

In Section 9.5 we explained how the usual duration measure can be converted to a 
dollar duration measure by multiplying by the value of the portfolio. The same is true of 
partial duration measures. The result is deltas describing the rate of change in the value 
of the portfolio with respect to a small change in one vertex of the term structure. 


9.6.2 Bucket Deltas 


A variation on the partial duration approach is to divide the yield curve into a number of 
segments or buckets and calculate for each bucket the dollar impact of changing all the 
zero rates corresponding to the bucket by one basis point while keeping all other zero 
rates unchanged. This approach is often used in asset-liability management (see Section 
9.1) and is referred to as GAP management. Figure 9.7 shows the type of change that would 
be considered for the segment of the zero curve between 2.0 and 3.0 years in Figure 9.4. 
The sum of the deltas for all the segments equals the dollar impact of a one-basis-point 
parallel shift. This is known as the DV01. 


9.6.3 Calculating Deltas to Facilitate Hedging 


One of the problems with the exposure measures that we have considered so far is that 
they are not designed to make hedging easy. Consider the partial durations in Table 9.5. If 
we plan to hedge our portfolio with zero-coupon bonds, we can calculate the position in 
a one-year zero-coupon bond to zero out the exposure to the one-year rate, the position 
in a two-year zero-coupon bond to zero out the exposure to the two-year rate, and so 
on. But if other instruments such as swaps or coupon-bearing bonds are used, a much 
more complicated analysis is necessary. 
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Often traders tend to use positions in the instruments that have been used to con- 
struct the zero curve to hedge their exposure. For example, a government bond trader is 
likely to take positions in the actively traded government bonds that were used to con- 
struct the Treasury zero curve when hedging. A trader of instruments dependent on the 
LIBOR/swap yield curve is likely take a position in LIBOR deposits, Eurodollar futures, 
and swaps when hedging. 

To facilitate hedging, traders therefore often calculate the impact of small changes in 
the quotes for each of the instruments used to construct the zero curve. The quote for the 
instrument is changed by a small amount, the zero-coupon yield curve is recomputed, 
and the portfolio is revalued. Consider a trader responsible for interest rate caps and swap 
options. Suppose that, when there is a one-basis-point change in a Eurodollar futures 
quote, the portfolio value increases by $500. Each Eurodollar futures contract changes 
in value by $25 for a one-basis-point change in the Eurodollar futures quote. It follows 
that the trader’s exposure can be hedged with 20 contracts. Suppose that the exposure to 
a one-basis-point change in the five-year swap rate is $4,000 and that a five-year swap 
with a notional principal of $1 million changes in value by $400 for a one-basis-point 
change in the five-year swap rate. The exposure can be hedged by trading swaps with a 
notional principal of $10 million. 


9.7 Principal Components Analysis 


The approaches we have just outlined can lead to analysts calculating 10 to 15 different 
deltas for every zero curve. This seems like overkill because the variables being considered 
are quite highly correlated with each other. For example, when the yield on a five-year 
bond moves up by a few basis points, most of the time the yield on a ten-year bond 
moves in a similar way. Arguably a trader should not be worried when a portfolio has a 
large positive exposure to the five-year rate and a similar large negative exposure to the 
ten-year rate. 

One approach to handling the risk arising from groups of highly correlated mar- 
ket variables is principal components analysis. This is a standard statistical tool with 
many applications in risk management. It takes historical data on daily changes in the 
market variables and attempts to define a set of components or factors that explain the 
movements. 

The approach is best illustrated with an example. The market variables we will con- 
sider are swap rates with maturities of 1 year, 2 years, 3 years, 4 years, 5 years, 7 years, 10 
years, and 30 years. Tables 9.6 and 9.7 show results produced for these market variables 
using 2,780 daily observations between 2000 and 2011. The first column in Table 9.6 
shows the maturities of the rates that were considered. The remaining eight columns in 
the table show the eight factors (or principal components) describing the rate moves. 
The first factor, shown in the column labeled PC1, corresponds to a roughly parallel 
shift in the yield curve. When we have one unit of that factor, the one-year rate increases 
by 0.216 basis points, the two-year rate increases by 0.331 basis points, and so on. The 
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Table 9.6 Factor Loadings for Swap Data 
PC1 PC2 PC3 PC4 PC5 PC6 PC7 PC8 


1-year 0.216 —0.501 0.627 —0.487 0.122 0.237 0.011 —0.034 
2-year 0.331 —0.429 0.129 0.354 —0.212 —0.674 —0.100 0.236 
3-year 0.372 —0.267 —0.157 0.414 —0.096 0.311 0.413 —0.564 
4-year 0.392 —0.110 —0.256 0.174 —0.019 0.551 —0.416 0.512 
5-year 0.404 0.019 —0.355 —0.269 0.595 —0.278 —0.316 —0.327 
7-year 0.394 0.194 —0.195 —0.336 0.007 —0.100 0.685 0.422 
10-year 0.376 0.371 0.068 —0.305 —0.684 —0.039 —0.278 —0.279 
30-year 0.305 0.554 0.575 0.398 0.331 0.022 0.007 0.032 


Table 9.7 Standard Deviation of Factor Scores 
PC1 PC2 PC3 PC4 PC5 PC6 PC7 PC8 


17.55 4.77 2.08 1.29 0.91 0.73 0.56 0.53 


second factor is shown in the column labeled PC2. It corresponds to a rotation or change 
of slope of the yield curve. Rates between 1 year and 4 years move in one direction while 
rates between 5 years and 30 years move in the other direction. A third factor corresponds 
to a “bowing” of the yield curve. Relatively short (1-year and 2-year) and relatively long 
(10-year and 30-year) rates move in one direction; the intermediate rates move in the 
other direction. An interest rate move for a particular factor is known as the factor loading. 
In our example, the first factor’s loading for the one-year rate is 0.216? 

Because there are eight rates and eight factors, the interest rate changes observed on 
any given day can always be expressed as a linear sum of the factors by solving a set of 
eight simultaneous equations. When this is done, the quantity of a particular factor in 
the interest rate changes on a particular day is known as the factor score for that day. 

The importance of a factor is measured by the standard deviation of its factor score. 
The standard deviations of the factor scores in our example are shown in Table 9.7 and 
the factors are listed in order of their importance. In carrying out the analysis, interest 
rate movements were measured in basis points. A quantity of the first factor equal to 
one standard deviation, therefore, corresponds to the one-year rate moving by 0.216 x 
17.55 = 3.78 basis points, the two-year rate moving by 0.331 x 17.55 = 5.81 basis points, 
and so on. 

Software for carrying out the calculations underlying Tables 9.6 and 9.7 is on the 
author’s website. The calculations are explained in Appendix I at the end of the book. 
To implement principal components analysis, it is first necessary to calculate a variance- 
covariance matrix from the observations (see Chapter 14 for a discussion of variance- 
covariance matrices). In our example, the variance-covariance matrix is a matrix with 
eight rows and eight columns with the first element of the first row being the variance 


? The factor loadings have the property that the sum of their squares for each factor is 1.0. Also, note 
that a factor is not changed if the signs of all its factor loadings are reversed. 
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Figure 9.8 The Three Most Important Factors Driving Movements in Swap Rates 


of the daily changes in the one-year rate, the second element of the first row being 
the covariance between the daily changes in the one-year rate and the daily changes in 
the two-year rate, and so on. The factor loadings are the eigenvectors calculated from 
this matrix and the variance of the factor scores are the eigenvalues calculated from the 
matrix. (Eigenvectors and eigenvalues are explained in Appendix H.) 

The factors have the property that the factor scores are uncorrelated across the data. 
For instance, in our example, the first factor score (amount of parallel shift) is uncorrelated 
with the second factor score (amount of twist) across the 2,780 days. The variances of 
the factor scores have the property that they add up to the total variance of the data. 
From Table 9.7, the total variance of the original data (that is, sum of the variance of the 
observations on the one-year rate, the variance of the observations on the two-year rate, 
and so on) is 


17.55? + 4.777 + 2.087 + ... + 0.537 = 338.8 


From this it can be seen that the first factor accounts for 17.557/338.8 = 90.9% of the 
variance in the original data; the first two factors account for 


(17.55% + 4.777) /338.8 = 97.7% 


of the variance in the data; the third factor accounts for a further 1.3% of the variance. 
This shows that most of the risk in interest rate moves is accounted for by the first two 
or three factors. It suggests that we can relate the risks in a portfolio of interest-rate- 
dependent instruments to movements in these factors instead of considering all eight 
interest rates. The three most important factors from Table 9.6 are plotted in Figure 9.8.!° 


10 Results similar to those described here, concerning the nature of the factors and the amount of the 
total risk they account for, are obtained when a principal components analysis is used to explain the 
movements in almost any yield curve in any country. 
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Table 9.8 Change in Portfolio Value for a 1-Basis-Point Rate Move 
($ millions) 


3-Year 4-Year 5-Year 7-Year 10-Year 
Rate Rate Rate Rate Rate 
+10 +4 —8 —7 +2 


9.7.1 Using Principal Components Analysis to Calculate Deltas 


To illustrate how a principal components analysis can provide an alternative way of cal- 
culating deltas, suppose we have a portfolio with the exposures to interest rate moves 
shown in Table 9.8. A one-basis-point change in the three-year rate causes the portfolio 
value to increase by $10 million; a one-basis-point change in the four-year rate causes it 
to increase by $4 million; and so on. We use the first two factors to model rate moves. 
(As mentioned earlier, this captures over 97% of the variance in rate moves.) Using the 
data in Table 9.6, our delta exposure to the first factor (measured in millions of dollars 
per unit of the factor with the factor loadings being assumed to be in basis points) is 


10 X 0.372 + 4 x 0.392 — 8 x 0.404 — 7 x 0.394 + 2 x 0.376 = +0.05 
and our delta exposure to the second factor is 
10 x (—0.267) + 4 x (—0.110) — 8 x 0.019 — 7 x 0.194 + 2 x 0.371 = —3.88 


The approach being used here is similar to the approach described in Section 9.6 
where partial durations were used to estimate the impact of a particular type of shift 
in the yield curve. The advantage of using a principal components analysis is that it tells 
you which are the most appropriate shifts to consider. It also provides information on the 
relative importance of different shifts. In the example we have considered, the exposure 
to the second shift is almost 80 times greater than our exposure to the first shift. However, 
from Table 9.7, the standard deviation first shift is about 3.7 times as great as the standard 
deviation of the second shift. A measure of the importance of a factor for a particular 
portfolio is the product of the delta exposure and the standard deviation of the factor 
score. Using this measure, the second factor is over 20 times as important as the first 
factor for the portfolio in Table 9.8. 


9.8 Gamma and Vega 


When several delta measures are calculated for interest rates, there are many possible 
gamma measures. Suppose that 10 instruments are used to compute the zero curve and 
that we measure deltas with respect to changes in the quotes for each of these. Gamma 
is a second partial derivative of the form 07P/ dx,0x; where P is the portfolio value. We 
have 10 choices for x; and 10 choices for x, and a total of 55 different gamma measures. 
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This may be information overload. One approach is to ignore cross-gammas and focus 
on the 10 partial derivatives where i = j. Another is to calculate a single gamma measure 
as the second partial derivative of the value of the portfolio with respect to a parallel shift 
in the zero curve. (This is dollar convexity.) A further possibility is to calculate gammas 
with respect to the first two factors in a principal components analysis. 

The vega of a portfolio of interest rate derivatives measures its exposure to volatility 
changes. Different volatilities are used to price different interest rate derivatives. One 
approach is to make the same small change to all volatilities and calculate the effect on 
the value of the portfolio. Another is to carry out a principal components analysis to 
calculate factors that reflect the patterns of volatility changes across different instruments 
(caps, swaptions, bond options, etc.) that are traded. Vega measures can then be calculated 
for the first two or three factors. 


Summary 


A bank’s net interest margin is a measure of the excess of the interest rate it earns over the 
interest rate it pays. There are now well-established asset/liability management procedures 
to ensure that this remains roughly constant from year to year. 

LIBOR is an important interest rate that governs the rates paid on many floating- 
rate loans throughout the world. The LIBOR rate is a short-term borrowing rate for 
AA-rated financial institutions. A complete LIBOR term structure of interest rates is 
constructed from LIBOR rates, Eurodollar futures, and swap rates. Forward interest rates 
calculated from this term structure are the forward borrowing rates for companies that 
will be AA-rated at the beginning of the period covered by the forward contract—not 
companies that are AA-rated today. The LIBOR/swap term structure of interest rates 
has traditionally been used as a proxy for the term structure of risk-free interest rates. 
Overnight indexed swap rates are now used as risk-free discount rates. 

An important concept in interest rate markets is duration. Duration measures the 
sensitivity of the value of a portfolio to a small parallel shift in the zero-coupon yield 
curve. The relationship is 


AP = —PDAy 


where P is the value of the portfolio, D is the duration of the portfolio, Ay is the size of 
a small parallel shift in the zero curve, and AP is the resultant effect on the value of the 
portfolio. A more precise relationship is 


AP = —PDAy + 5PC(ay)? 


where C is the convexity of the portfolio. This relationship is accurate for relatively large 
parallel shifts in the yield curve but does not quantify the exposure to nonparallel shifts. 
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To quantify exposure to all the different ways the yield curve can change through 
time, several duration or delta measures are necessary. There are a number of ways these 
can be defined. A principal components analysis can be a useful alternative to calculating 
multiple deltas. It shows that the yield curve shifts that occur in practice are, to a large 
extent, a linear sum of two or three standard shifts. If a portfolio manager is hedged 
against these standard shifts, he or she is therefore also well hedged against the shifts that 
occur in practice. 
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Practice Questions and Problems 
(Answers at End of Book) 


9.1 Suppose that a bank has $5 billion of one-year loans and $20 billion of five-year 
loans. These are financed by $15 billion of one-year deposits and $10 billion of 
five-year deposits. Explain the impact on the bank’s net interest income of interest 
rates increasing by 1% every year for the next three years. 

9.2 Explain why long-term rates are higher than short-term rates most of the time. 
Under what circumstances would you expect long-term rates to be lower than 
short-term rates? 

9.3 Why are U.S. Treasury rates significantly lower than other rates that are close to 
risk free? 

9.4 Explain how an overnight indexed swap works. 

9.5 Explain why the LIBOR-OIS spread is a measure of stress in financial markets. 

9.6 What does duration tell you about the sensitivity of a bond portfolio to interest 
rates? What are the limitations of the duration measure? 

9.7 A five-year bond with a yield of 11% (continuously compounded) pays an 8% 
coupon at the end of each year. 

(a) What is the bond’s price? 

(b) What is the bond’s duration? 

(c) Use the duration to calculate the effect on the bond’s price of a 0.2% decrease 
in its yield. 

(d) Recalculate the bond’s price on the basis of a 10.8% per annum yield and verify 
that the result is in agreement with your answer to (c). 
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9.8 


9.9 


9.10 


9.11 


9.12 


9.13 
9.14 


9.15 


9.16 


9.17 


9.18 


9.19 


Repeat Problem 9.7 on the assumption that the yield is compounded annually. Use 
modified durations. 

A six-year bond with a continuously compounded yield of 4% provides a 5% 
coupon at the end of each year. Use duration and convexity to estimate the effect of 
a 1% increase in the yield on the price of the bond. How accurate is the estimate? 
Explain three ways in which multiple deltas can be calculated to manage nonparallel 
yield curve shifts. 

Consider a $1 million portfolio with the partial durations in Table 9.5. Estimate 
delta with respect to the first two factors in Table 9.6. 

Use the partial durations in Table 9.5 to calculate the impact of a shift in the yield 
curve on a $10 million portfolio where the 1-, 2-, 3-, 4-,5-, 7-, and 10-year rates 
increase by 10, 8, 7, 6,5, 3, and 1 basis points, respectively. 

How are “dollar duration” and “dollar convexity” defined? 

What is the relationship between (a) the duration, (b) the partial durations, and (c) 
the DV01 of a portfolio? 


Further Questions 


Prove (a) that the definitions of duration in equations (9.1) and (9.3) are the same 

when y is continuously compounded and (b) that when y is compounded m times 

per year they are the same if the right-hand side of equation (9.3) is divided by 

1+ y/m. 

Suppose that a bank has $10 billion of one-year loans and $30 billion of five-year 

loans. These are financed by $35 billion of one-year deposits and $5 billion of five- 

year deposits. The bank has equity totaling $2 billion and its return on equity is 

currently 12%. Estimate what change in interest rates next year would lead to the 

bank’s return on equity being reduced to zero. Assume that the bank is subject to 

a tax rate of 30%. 

Portfolio A consists of a one-year zero-coupon bond with a face value of $2,000 

and a 10-year zero-coupon bond with a face value of $6,000. Portfolio B consists 

of a 5.95-year zero-coupon bond with a face value of $5,000. The current yield 

on all bonds is 10% per annum (continuously compounded). 

(a) Show that both portfolios have the same duration. 

(b) Show that the percentage changes in the values of the two portfolios for a 0.1% 
per annum increase in yields are the same. 

(c) What are the percentage changes in the values of the two portfolios for a 5% 
per annum increase in yields? 

What are the convexities of the portfolios in Problem 9.17? To what extent do (a) 

duration and (b) convexity explain the difference between the percentage changes 

calculated in part (c) of Problem 9.17? 

When the partial durations are as in Table 9.5, estimate the effect of a shift in the 

yield curve where the ten-year rate stays the same, the one-year rate moves up by 9e, 
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and the movements in intermediate rates are calculated by interpolation between 
9e and 0. How could your answer be calculated from the results for the rotation 
calculated in Section 9.6? 

9.20 Suppose that the change in a portfolio value for a one-basis-point shift in the 1-, 
2-, 3-, 4-, 5-, 7-, 10-, and 30-year rates are (in $ millions) +5, —3, —1, +2, +5, +7, 
+8, and +1, respectively. Estimate the delta of the portfolio with respect to the first 
three factors in Table 9.6. Quantify the relative importance of the three factors for 
this portfolio. 


Chapter 10 


Volatility 


t is important for a financial institution to monitor the volatilities of the market 
[oss (interest rates, exchange rates, equity prices, commodity prices, etc.) on 

which the value of its portfolio depends. This chapter describes the procedures it 
can use to do this. 

The chapter starts by explaining how volatility is defined. It then examines the com- 
mon assumption that percentage returns from market variables are normally distributed 
and presents the power law as an alternative. After that it moves on to consider models 
with imposing names such as exponentially weighted moving average (EWMA), autore- 
gressive conditional heteroscedasticity (ARCH), and generalized autoregressive condi- 
tional heteroscedasticity (GARCH). The distinctive feature of these models is that they 
recognize that volatility is not constant. During some periods, volatility is relatively low, 
while during other periods it is relatively high. The models attempt to keep track of 
variations in volatility through time. 


10.1 Definition of Volatility 


A variable’s volatility, ©, is defined as the standard deviation of the return provided by the 
variable per unit of time when the return is expressed using continuous compounding. 
(See Appendix A for a discussion of compounding frequencies.) When volatility is used 
for option pricing, the unit of time is usually one year, so that volatility is the standard 
deviation of the continuously compounded return per year. When volatility is used for 
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risk management, the unit of time is usually one day so that volatility is the standard 
deviation of the continuously compounded return per day. 

Define S, as the value ofa variable at the end of day i. The continuously compounded 
return per day for the variable on day i is 


This is almost exactly the same as 


An alternative definition of daily volatility of a variable is therefore the standard deviation 
of the proportional change in the variable during a day. This is the definition that is usually 
used in risk management. 


Example 10.1 


Suppose that an asset price is $60 and that its daily volatility is 2%. This means that a one- 
standard-deviation move in the asset price over one day would be 60 x 0.02 or $1.20. 
If we assume that the change in the asset price is normally distributed, we can be 95% 
certain that the asset price will be between 60 — 1.96 x 1.2 = $57.65 and 60 + 1.96 x 
1.2 = $62.35 at the end of the day. (This is what is referred to as a two-tailed test, where 
there is a 2.5% probability in each of the upper and lower tails of the distribution.) 


If we assume that the returns each day are independent with the same variance, the 
variance of the return over T days is T times the variance of the return over one day. This 
means that the standard deviation of the return over T days is VT times the standard 
deviation of the return over one day. This is consistent with the adage “uncertainty 
increases with the square root of time.” 


Example 10.2 


Assume as in Example 10.1 that an asset price is $60 and the volatility per day is 2%. The 


standard deviation of the continuously compounded return over five days is V5 X 2 or 
4.47%. Because five days is a short period of time, this can be assumed to be the same 
as the standard deviation of the proportional change over five days. A one-standard- 
deviation move would be 60 X 0.0447 = 2.68. If we assume that the change in the asset 
price is normally distributed, we can be 95% certain that the asset price will be between 
60 — 1.96 X 2.68 = $54.74 and 60 + 1.96 x 2.68 = $65.26 at the end of the five days. 
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10.1.1 Variance Rate 


Risk managers often focus on the variance rate rather than the volatility. The variance 
rate is defined as the square of the volatility. The variance rate per day is the variance of 
the return in one day. Whereas the standard deviation of the return in time T increases 
with the square root of time, the variance of this return increases linearly with time. If 
we wanted to be pedantic, we could say that it is correct to talk about the variance rate 
per day, but volatility is per square root of day. 


10.1.2 Business Days vs. Calendar Days 


One issue is whether time should be measured in calendar days or business days. As 
shown in Business Snapshot 10.1, research shows that volatility is much higher on 
business days than on non-business days. As a result, analysts tend to ignore weekends 
and holidays when calculating and using volatilities. The usual assumption is that there 
are 252 days per year. 

Assuming that the returns on successive days are independent and have the same 
standard deviation, this means that 


Oy, = Oday V 252 


or 


Oyr 


Oy = — 
S 4/252 


showing that the daily volatility is about 6% of annual volatility. 


10.2 Implied Volatilities 


Although risk managers usually calculate volatilities from historical data, they also try 
and keep track of what are known as implied volatilities. The one parameter in the Black- 
Scholes—Merton option pricing model that cannot be observed directly is the volatility 
of the underlying asset price (see Appendix E). The implied volatility of an option is the 
volatility that gives the market price of the option when it is substituted into the pricing 
model. 


10.2.1 The VIX Index 


The CBOE publishes indices of implied volatility. The most popular index, the VIX, is 
an index of the implied volatility of 30-day options on the S&P 500 calculated from a 
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BUSINESS SNAPSHOT 10.1 
What Causes Volatility? 


It is natural to assume that the volatility of a stock or other asset is caused by new 
information reaching the market. This new information causes people to revise 
their opinions about the value of the asset. The price of the asset changes and 
volatility results. However, this view of what causes volatility is not supported by 
research. 

With several years of daily data on an asset price, researchers can calculate: 


1. The variance of the asset’s returns between the close of trading on one day 
and the close of trading on the next day when there are no intervening 
nontrading days. 

2. The variance of the asset’s return between the close of trading on Friday and 
the close of trading on Monday. 


The second is the variance of returns over a three-day period. The first is a 
variance over a one-day period. We might reasonably expect the second variance 
to be three times as great as the first variance. Fama (1965), French (1980), and 
French and Roll (1986) show that this is not the case. For the assets considered, 
the three research studies estimate the second variance to be 22%, 19%, and 
10.7% higher than the first variance, respectively. 

At this stage you might be tempted to argue that these results are explained by 
more news reaching the market when the market 1s open for trading. But research 
by Roll (1984) does not support this explanation. Roll looked at the prices of 
orange juice futures. By far the most important news for orange juice futures is 
news about the weather, and this is equally likely to arrive at any time. When Roll 
compared the two variances for orange juice futures, he found that the second 
(Friday-to-Monday) variance is only 1.54 times the first (one-day) variance. 

The only reasonable conclusion from all this is that volatility is, to a large 
extent, caused by trading itself. (Traders usually have no difficulty accepting this 
conclusion!) 


wide range of calls and puts.! Trading in futures on the VIX started in 2004 and trading 
in options on the VIX started in 2006. A trade involving options on the S&P 500 is a bet 
on both the future level of the S&P 500 and the volatility of the S&P 500. By contrast, 
a futures or options contract on the VIX is a bet only on volatility. One contract is on 
1,000 times the index. 


' Similarly, the VXN is an index of the volatility of the NASDAQ 100 index and the VXD is an index 
of the volatility of the Dow Jones Industrial Average. 
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Figure 10.1 The VIX Index, January 2004 to June 2017 


Example 10.3 


Suppose that a trader buys an April futures contract on the VIX when the futures price is 
$18.5 (corresponding to a 30-day S&P 500 volatility of 18.5%) and closes out the contract 
when the futures price is $19.3 (corresponding to an S&P 500 volatility of 19.3%). The 
trader makes a gain of $800. 


Figure 10.1 shows the VIX index between January 2004 and June 2017. Between 
2004 and mid-2007, it tended to stay between 10 and 20. It reached 30 during the second 
half of 2007 and a record 80 in October and November 2008 after the failure of Lehman 
Brothers. By early 2010, it had returned to more normal levels, but it spiked again later 
because of market uncertainties. Sometimes market participants refer to the VIX index 
as the fear index. 


10.3 Are Daily Percentage Changes in 
Financial Variables Normal? 


When confidence limits for the change in a market variable are calculated from its 
volatility, a common assumption is that the change is normally distributed. This is the 
assumption we made in Examples 10.1 and 10.2. In practice, most financial variables 
are more likely to experience big moves than the normal distribution would suggest. 
Table 10.1 shows the results of a test of normality using daily movements in 10 differ- 
ent exchange rates over a 10-year period between 2005 and 2015. The exchange rates 
are those between the U.S. dollar and the following currencies: Australian dollar, British 
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Table 10.1 Percentage of Days When Absolute Size of Daily 
Exchange Rate Moves Is Greater Than One, Two,..., Six Standard 
Deviations (S.D. = standard deviation of percentage daily change) 


Real Normal 
World (%) Model (%) 
>1S.D. 23.32 31.73 
> 2 S.D. 4.67 4.55 
> 3 S.D. 1.30 0.27 
> 4S.D. 0.49 0.01 
>5S.D. 0.24 0.00 
> 6S.D. 0.13 0.00 


pound, Canadian dollar, Danish krone, euro, Japanese yen, Mexican peso, New Zealand 
dollar, Swedish krona, and Swiss franc. The first step in the production of the table is to 
calculate the standard deviation of daily percentage changes in each exchange rate. The 
next stage is to note how often the actual percentage changes exceeded one standard 
deviation, two standard deviations, and so on. These numbers are then compared with 
the corresponding numbers for the normal distribution. 

Daily percentage changes exceed three standard deviations on 1.30% of the days. The 
normal model for returns predicts that this should happen on only 0.27% of days. Daily 
percentage changes exceed four, five, and six standard deviations on 0.49%, 0.24%, and 
0.13% of the days, respectively. The normal model predicts that we should hardly ever 
observe this happening. The table, therefore, provides evidence to support the existence 
of the fact that the probability distributions of changes in exchange rates have heavier 
tails than the normal distribution. 

When returns are continuously compounded, the return over many days is the sum 
of the returns over each of the days. If the probability distribution of the return in a day 
were the same non-normal distribution each day, the central limit theorem of statistics 
would lead to the conclusion that the return over many days is normally distributed. In 
fact, the returns on successive days are not identically distributed. (One reason for this, 
which will be discussed later in this chapter, is that volatility is not constant.) As a result, 
heavy tails are observed in the returns over relatively long periods as well as in the returns 
observed over one day. Business Snapshot 10.2 shows how you could have made money 
if you had done an analysis similar to that in Table 10.1 in 1985! 

Figure 10.2 compares a typical heavy-tailed distribution (such as the one for an 
exchange rate) with a normal distribution that has the same mean and standard devi- 
ation” The heavy-tailed distribution is more peaked than the normal distribution. In 
Figure 10.2, we can distinguish three parts of the distribution: the middle, the tails, and 


? Kurtosis measures the size of a distribution’s tails. A leptokurtic distribution has heavier tails than the nor- 
mal distribution. A platykurtic distribution has less heavy tails than the normal distribution; a distribution 
with the same-size tails as the normal distribution is termed mesokurtic. 
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BUSINESS SNAPSHOT 10.2 
Making Money from Foreign Currency Options 


Black, Scholes, and Merton in their option pricing model assume that the under- 
lying asset’s price has a lognormal distribution at a future time. This is equivalent 
to the assumption that asset price changes over short periods, such as one day, 
are normally distributed. Suppose that most market participants are comfortable 
with the assumptions made by Black, Scholes, and Merton. You have just done 
the analysis in Table 10.1 and know that the normal/lognormal assumption is 
not a good one for exchange rates. What should you do? 

The answer is that you should buy deep-out-of-the-money call and put 
options on a variety of different currencies—and wait. These options will be 
relatively inexpensive and more of them will close in-the-money than the Black— 
Scholes-Merton model predicts. The present value of your payoffs will on aver- 
age be much greater than the cost of the options. 

In the mid-1980s, a few traders knew about the heavy tails of foreign 
exchange probability distributions. Everyone else thought that the lognormal 
assumption of the Black—Scholes—Merton model was reasonable. The few traders 
who were well informed followed the strategy we have described—and made 
lots of money. By the late 1980s, most traders understood the heavy tails and the 
trading opportunities had disappeared. 


~<— Heavy-tailed 


\<— Normal 
\ 


Figure 10.2 Comparison of a Normal Distribution with a Heavy-Tailed Distribution 
The two distributions have the same mean and standard deviation. 


220 RISK MANAGEMENT AND FINANCIAL INSTITUTIONS 


the intermediate parts (between the middle and the tails). When we move from the nor- 
mal distribution to the heavy-tailed distribution, probability mass shifts from the inter- 
mediate parts of the distribution to both the tails and the middle. If we are considering 
the percentage change in a market variable, the heavy-tailed distribution has the prop- 
erty that small and large changes in the variable are more likely than they would be if a 
normal distribution were assumed. Intermediate changes are less likely. 


10.4 The Power Law 


The power law provides an alternative to assuming normal distributions. The law asserts 
that, for many variables that are encountered in practice, it is approximately true that the 
value of the variable, v, has the property that when x is large 


Prob(v > x) = Kx“ (10.1) 


where K and a are constants. The equation has been found to be approximately true for 
variables v as diverse as the income of an individual, the size of a city, and the number of 
visits to a website in a day. 


Example 10.4 


Suppose that we know from experience that a = 3 for a particular financial variable and 
we observe that the probability that v > 10 is 0.05. Equation (10.1) gives 


0.05 = Kx 107° 
so that K = 50. The probability that v > 20 can now be calculated as 
50 x 207° = 0.00625 
The probability that v > 30 is 
50 x 307° = 0.0019 


and so on. 


Equation (10.1) implies that 


In[Prob(v > x)] = In K — a ln x 
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Table 10.2 Values Calculated from Table 10.1 


x In(x) Prob(v > x) In[Prob(v > x)] 
1 0.000 0.2332 —1.4560 
2 0.693 0.0467 —3.0634 
3 1.099 0.0130 —4.3421 
4 1.386 0.0049 —5.3168 
5 1.609 0.0024 —6.0182 
6 1.792 0.0013 —6.6325 


We can therefore do a quick test of whether it holds by plotting In[Prob(v > x)] against 
In(x). In order to do this for the data in Table 10.1, define v as the number of standard 
deviations by which an exchange rate changes in one day. 

The values of In(x) and In[Prob(v > x)] are calculated in Table 10.2. The data in 
Table 10.2 are plotted in Figure 10.3. This shows that the logarithm of the probability of 
the exchange rate changing by more than x standard deviations is approximately linearly 
dependent on In(x) for x > 2. This is evidence that the power law holds for these data. 
Using data for x = 3, 4,5, and 6, a regression analysis gives the best-fit relationship as 


In[Prob(v > x)] = —0.735 — 3.291 ln(x) 


showing that estimates for K and « are as follows: K = e707 = 0.479 and a = 3.291. 
An estimate for the probability of a change greater than 4.5 standard deviations is 


0.479 x 4.5°-°7°! = 0.00340 
An estimate for the probability of a change greater than seven standard deviations is 


0.479 x 7°?! = 0.000794 
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Figure 10.3 Log-Log Plot for Probability That Exchange Rate Moves by More Than a Certain 
Number of Standard Deviations 
v is the exchange rate change measured in standard deviations. 
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We examine the power law more formally and explain better procedures for estimating 
the parameters when we consider extreme value theory in Chapter 13. We also consider 
its use in the assessment of operational risk in Chapter 23. 


10.5 Monitoring Daily Volatility 


Define ©, as the volatility per day of a market variable on day n, as estimated at the end 
of day n — 1. The variance rate, which, as mentioned earlier, is defined as the square of the 
volatility, is oO. Suppose that the value of the market variable at the end of day i is S,. 
Define u; as the continuously compounded return during day i (between the end of day 
i— 1 and the end of day i) so that 


One approach to estimating ©, is to set it equal to the standard deviation of the u;’s. 
When the most recent m observations on the u; are used in conjunction with the usual 
formula for standard deviation, this approach gives: 


m 


o = — ) (u,-;- 7)” (10.2) 


where u is the mean of the u,’3: 


Example 10.5 


Table 10.3 shows a possible sequence of stock prices. Suppose that we are interested in 
estimating the volatility for day 21 using 20 observations on the u; so that n = 21 and 
m = 20. In this case, u = 0.00074 and the estimate of the standard deviation of the daily 
return calculated using equation (10.2) is 1.49%. 


For risk management purposes, the formula in equation (10.2) is usually changed in 
a number of ways: 


1. As indicated in Section 10.1, u; is defined as the percentage change in the market 
variable between the end of day i — 1 and the end of day i so that 
S; — Si- 


10.3 
ae (10.3) 


This makes very little difference to the values of u; that are computed. 
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Table 10.3 Data for Computation of Volatility 


Closing Stock Price Relative Daily Return 
Day Price (dollars) S;/S;-4 u; = In(S;/S,_,) 
0 20.00 
1 20.10 1.00500 0.00499 
2 19.90 0.99005 —0.01000 
3 20.00 1.00503 0.00501 
4 20.50 1.02500 0.02469 
5 20.25 0.98780 —0.01227 
6 20.90 1.03210 0.03159 
7 20.90 1.00000 0.00000 
8 20.90 1.00000 0.00000 
g 20.60 0.98565 —0.01446 
10 20.50 0.99515 —0.00487 
11 21.00 1.02439 0.02410 
12 21.10 1.00476 0.00475 
13 20.70 0.98104 —0.01914 
14 20.50 0.99034 —0.00971 
15 20.70 1.00976 0.00971 
16 20.90 1.00966 0.00962 
17 20.40 0.97608 —0.02421 
18 20.50 1.00490 0.00489 
19 20.60 1.00488 0.00487 
20 20.30 0.98544 —0.01467 


2. u is assumed to be zero. The justification for this is that the expected change in 
a variable in one day is very small when compared with the standard deviation of 
changes? 

3. m— 1 is replaced by m. This moves us from an unbiased estimate of the volatility to 
a maximum likelihood estimate, as we explain in Section 10.9. 


These three changes allow the formula for the variance rate to be simplified to 


> 1 m > 
gp =s u“. 10.4 
a= De (10.4) 


where u; is given by equation (10.3). 


3 This is likely to be the case even if the variable happened to increase or decrease quite fast during 
the m days of our data. 
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Example 10.6 
Consider again Example 10.5. When n = 21 and m = 20, 


>) _, = 0.00424 
i=1 


so that equation (10.4) gives 


o? = 0.00424/20 = 0.000214 


and o, = 0.014618 or 1.46%. This is only a little different from the result in 
Example 10.5. 


10.5.1 Weighting Schemes 


Equation (10.4) gives equal weight to each of u7_,, u? e 


n-1? “n-2?"* nm" 
to estimate o,, the volatility on day n. It therefore makes sense to give more weight to 


recent data. A model that does this is 


., and u Our objective is 


m 


o = > Ou, (10.5) 
i=1 


The variable a; is the amount of weight given to the observation i days ago. The o's 
are positive. If we choose them so that a; < a; when i > j, less weight is given to older 
observations. The weights must sum to unity, so that 


m 


> a; =1 
i=1 


An extension of the idea in equation (10.5) is to assume that there is a long-run 
average variance rate and that this should be given some weight. This leads to the model 
that takes the form 


m 


o2 = yV, + Ya, (10.6) 


i=1 


where V; is the long-run variance rate and y is the weight assigned to V,. Because the 
weights must sum to unity, we have 


m 


y+ Yo;=1 
i=1 
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This is known as an ARCH(m) model. It was first suggested by Engle.* The estimate of 
the variance is based on a long-run average variance and m observations. The older an 
observation, the less weight it is given. Defining @ = y V}, the model in equation (10.6) 
can be written 


m 


oo =at+ » On. (10.7) 
i=1 


In the next two sections, we discuss two important approaches to monitoring volatil- 
ity using the ideas in equations (10.5) and (10.6). 


10.6 The Exponentially Weighted Moving Average Model 


The exponentially weighted moving average (EWMA) model is a particular case of the 
model in equation (10.5) where the weights, «;, decrease exponentially as we move back 
through time. Specifically, w,,, = Aa; where A is a constant between zero and one. 

It turns out that this weighting scheme leads to a particularly simple formula for 
updating volatility estimates. The formula is 


Oo =)o?_, +(1-Ayw 


n r n—1 


(10.8) 


The estimate, ©,„, of the volatility for day n (made at the end of day n — 1) is calculated 
from 6,_, (the estimate that was made at the end of day n — 2 of the volatility for day 


n? 


n— 1) and u,_, (the most recent daily percentage change). 
To understand why equation (10.8) corresponds to weights that decrease exponen- 
tially, we substitute for o to get 


o = MAc? + (1 = Au a] + (1A 


or 


o = (1-H) HAO, 


n—1 
Substituting in a similar way for o, gives 


2 2 2 2,2 Den 
o = (1-A)u_, +u a HAU a) +o, 


+See R. FE Engle, “Autoregressive Conditional Heteroscedasticity with Estimates of the Variance of 
U.K. Inflation,” Econometrica 50 (1982): 987-1008. Robert Engle won the Nobel Prize in Economics 
in 2003 for his work on ARCH models. 
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Continuing in this way, we see that 


For a large m, the term No? is sufficiently small to be ignored so that equation (10.8) 
is the same as equation (10.5) with a, = (1 —A)Ai!. The weights for the u; decline at 
rate A as we move back through time. Each weight is À times the previous weight. 


Example 10.7 


Suppose that A is 0.90, the volatility estimated for a market variable for day n— 1 is 
1% per day, and during day n — 1 the market variable increased by 2%. This means that 
6, = 0.017 = 0.0001 and u-_, = 0.02? = 0.0004. Equation (10.8) gives 


o? = 0.9 X 0.0001 + 0.1 X 0.0004 = 0.00013 


The estimate of the volatility for day n, © „is, therefore, y 0.00013 or 1.14% per day. Note 


that the expected value of AA is o or 0.0001. In this example, the realized value of 
AA is greater than the expected value, and as a result our volatility estimate increases. 


If the realized value of AA had been less than its expected value, our estimate of the 


volatility would have decreased. 


n? 


The EWMA approach has the attractive feature that the data storage requirements 
are modest. At any given time, we need to remember only the current estimate of the 
variance rate and the most recent observation on the value of the market variable. When 
we get a new observation on the value of the market variable, we calculate a new daily 
percentage change and use equation (10.8) to update our estimate of the variance rate. 
The old estimate of the variance rate and the old value of the market variable can then 
be discarded. 

The EWMA approach is designed to track changes in the volatility. Suppose there is 
a big move in the market variable on day n — 1 so that w is large. From equation (10.8) 
this causes our estimate of the current volatility to move upward. The value of A governs 
how responsive the estimate of the daily volatility is to the most recent daily percentage 
change. A low value of A leads to a great deal of weight being given to the ae when ©, 
is calculated. In this case, the estimates produced for the volatility on successive days are 
themselves highly volatile. A high value of A (i.e., a value close to 1.0) produces estimates 
of the daily volatility that respond relatively slowly to new information provided by the 
daily percentage change. 

The RiskMetrics database, which was originally created by JPMorgan and made 
publicly available in 1994, used the EWMA model with à = 0.94 for updating daily 
volatility estimates. The company found that, across a range of different market variables, 
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this value of À gives forecasts of the variance rate that come closest to the realized variance 
rate.” In 2006, RiskMetrics switched to using a long memory model. This is a model where 
the weights assigned to the w as i increases decline less fast than in EWMA. 


10.7 The GARCH(1,1) Model 


We now move on to discuss what is known as the GARCH(1,1) model, proposed by 
Bollerslev in 1986.° The difference between the EWMA model and the GARCH(1,1) 
model is analogous to the difference between equation (10.5) and equation (10.6). In 
GARCH(1,1), o? is calculated from a long-run average variance rate, V,, as well as from 
6,,-; and u,_,. The equation for GARCH(1,1) is 


+ Bor (10.9) 


2 2 
0; T YVL i Ou, n-1 


1 
where y is the weight assigned to V}, & is the weight assigned to ae and B is the weight 
assigned to o . Because the weights must sum to one: 


y+a+ß=1 


The EWMA model is a particular case of GARCH(1,1) where y = 0, a = 1 — À, and 
B=A. 

The “(1,1)”in GARCH (1,1) indicates that o? is based on the most recent observation 
of u? and the most recent estimate of the variance rate. The more general GARCH(p, 
q) model calculates o? from the most recent p observations on u? and the most recent q 
estimates of the variance rate” GARCH(1,1) is by far the most popular of the GARCH 
models. 

Setting @ = y V}, the GARCH(1,1) model can also be written 


+ Bo (10.10) 


n-1 


2 
Oo =0+ our 
n n-1 


>See JPMorgan, RiskMetrics Monitor, Fourth Quarter 1995. We will explain an alternative (maximum 
likelihood) approach to estimating parameters later in the chapter. The realized variance rate on a 
particular day was calculated as an equally weighted average of the we on the subsequent 25 days. (See 
Problem 10.20.) 

é See T. Bollerslev, “Generalized Autoregressive Conditional Heteroscedasticity,” Journal of Econometrics 
31 (1986): 307-327. 

7 Other GARCH models have been proposed that incorporate asymmetric news. These models are 
designed so that 6, depends on the sign of u,_,. Arguably, the models are more appropriate than 
GARCH(1,1) for equities. This is because the volatility of an equity’s price tends to be inversely related 
to the price so that a negative u,_, should have a bigger effect on o, than the same positive u,,_,. For 
a discussion of models for handling asymmetric news, see D. Nelson, “Conditional Heteroscedasticity 
and Asset Returns: A New Approach,” Econometrica 59 (1990): 347-370, and R. E Engle and V. Ng, 
“Measuring and Testing the Impact of News on Volatility,’ Journal of Finance 48 (1993): 1749-1778. 
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This is the form of the model that is usually used for the purposes of estimating the 
parameters. Once @, a, and B have been estimated, we can calculate y as 1 — a — p. 
The long-term variance V, can then be calculated as w/y. For a stable GARCH(1,1) 
model, we require a + P < 1. Otherwise the weight applied to the long-term variance is 
negative. 


Example 10.8 
Suppose that a GARCH(1,1) model is estimated from daily data as 


o? = 0.000002 + 0.13u°_, + 0.8607_, 
This corresponds to aœ = 0.13, B = 0.86, and œ = 0.000002. Because y = 1 — a — P, it 
follows that y = 0.01 and because œ = yV_, it follows that V, = 0.0002. In other words, 
the long-run average variance per day implied by the model is 0.0002. This corresponds 
to a volatility of 170.0002 = 0.014 or 1.4% per day. 
Suppose that the estimate of the volatility on day n — 1 is 1.6% per day so that o | = 
0.0167 = 0.000256, and that on day n — 1 the market variable decreased by 1% so that 
u-_, = 0.017 = 0.0001. Then: 


o? = 0.000002 + 0.13 x 0.0001 + 0.86 x 0.000256 = 0.00023516 


The new estimate of the volatility is, therefore, y 0.00023516 = 0.0153 or 1.53% per 
day. 


10.7.1 The Weights 


Substituting for o? in equation (10.10) we obtain 


1 
2 2 2 2 
o = œ + au; + Po + aus + Bo?) 


or 


2 


Ei 2 
oO =o+r Bo + au + apus, 


22 
+ p S19 


Substituting for o7_, we get 
we 


2 


2 


o? = œ + bo + P'o + au? | + apu? 


n—2 


Did: Dd 
+ ap Ming + O3 


Continuing in this way, we see that the weight applied to TA is apit. The weights 
decline exponentially at rate B. The parameter B can be interpreted as a decay rate. It is 
similar to A in the EWMA model. It defines the relative importance of the observations 
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on the u; in determining the current variance rate. For example, if B = 0.9, u?_, is only 
90% as important as u; A and so on. The GARCH(1,1) 


model is the same as the EWMA model except that, in addition to assigning weights 


is 81% as important as u7_,; 
n—1 
. . 2 . . . 
that decline exponentially to past u;, it also assigns some weight to the long-run average 
variance rate. 


10.8 Choosing Between the Models 


In practice, variance rates do tend to be pulled back to a long-run average level. This is the 
mean reversion phenomenon discussed in Section 7.5. The GARCH(1,1) model incor- 
porates mean reversion whereas the EWMA model does not. GAR CH (1,1) is, therefore, 
theoretically more appealing than the EWMA model. 

In the next section, we will discuss how best-fit parameters œ, a, and B in 
GARCH(1,1) can be estimated. When the parameter @ is zero, the GARCH(1,1) reduces 
to EWMaA. In circumstances where the best-fit value of @ turns out to be negative, the 
GARCH(1,1) model is not stable and it makes sense to switch to the EWMA model. 


10.9 Maximum Likelihood Methods 


It is now appropriate to discuss how the parameters in the models we have been con- 
sidering are estimated from historical data. The approach used is known as the maximum 
likelihood method. It involves choosing values for the parameters that maximize the chance 
(or likelihood) of the data occurring. 

We start with a very simple example. Suppose that we sample 10 stocks at random 
on a certain day and find that the price of one of them declined during the day and the 
prices of the other nine either remained the same or increased. What is our best estimate 
of the proportion of stock prices that declined during the day? The natural answer is 0.1. 
Let us see if this is the result given by maximum likelihood methods. 

Suppose that the probability ofa price decline is p. The probability that one particu- 
lar stock declines in price and the other nine do not is p(1 — p)’. (There is a probability p 
that it will decline and 1 — p that each of the other nine will not.) Using the maximum 
likelihood approach, the best estimate of p is the one that maximizes p(1 — p)’. Differ- 
entiating this expression with respect to p and setting the result equal to zero, it can be 
shown that p = 0.1 maximizes the expression. The maximum likelihood estimate of p is 
therefore 0.1, as expected. 


10.9.1 Estimating a Constant Variance 


In our next example of maximum likelihood methods, we consider the problem of esti- 
mating a variance of a variable X from m observations on X when the underlying dis- 
tribution is normal. We assume that the observations are u4, U,..., u,, and that the mean 
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of the underlying normal distribution is zero. Denote the variance by v. The likelihood 
of u; being observed is the probability density function for X when X = u;. This is 


The likelihood of m observations occurring in the order in which they are observed is 


i —u? 
II as (£) (10.11) 
i=1 | V 2nv 2v 


Using the maximum likelihood method, the best estimate of v is the value that maximizes 


this expression. 

Maximizing an expression is equivalent to maximizing the logarithm of the expres- 
sion. Taking logarithms of the expression in equation (10.11) and ignoring constant 
multiplicative factors, it can be seen that we wish to maximize 


m 2 
2, - In(v) — a (10.12) 


or 


Differentiating this expression with respect to v and setting the result equation to zero, 
it can be shown that the maximum likelihood estimator of v is 


m 


1 
Se 
m a 


This maximum likelihood estimator is the one we used in equation (10.4). The corre- 
sponding unbiased estimator is the same with m replaced by m — 1. 


10.9.2 Estimating EWMA or GARCKH(1,1) 


We now consider how the maximum likelihood method can be used to estimate the 
parameters when EWMA, GARCH(1,1), or some other volatility updating procedure 
is used. Define v; = o? as the variance estimated for day i. Assume that the probability 
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Table 10.4 Estimation of Parameters in GARCH(1,1) Model for S&P 500 between July 18, 2005, 
and August 13, 2010 


Date Day i S; u; v = 07 — In(v;) — w/v; 
18-Jul-2005 1 1221.13 
19-Jul-2005 2 1229.35 0.006731 
20-Jul-2005 3 1235.20 0.004759 0.00004531 9.5022 
21-Jul-2005 4 1227.04 —0.006606 0.00004447 9.0393 
22-Jul-2005 5 1233.68 0.005411 0.00004546 9.3545 
25-Jul-2005 6 1229.03 —0.003769 0.00004517 9.6906 
11-Aug-2010 1277 1089.47 —0.028179 0.00011834 2.3322 
12-Aug-2010 1278 1083.61 —0.005379 0.00017527 8.4841 
13-Aug-2010 1279 1079.25 —0.004024 0.00016327 8.6209 
10,228.2349 

Trial Estimates of GARCH parameters 

@ a p 
0.000001347 0.08339 0.9101 


distribution of u; conditional on the variance is normal. A similar analysis to the one just 
given shows the best parameters are the ones that maximize 


m 1 =u? 
wea 
i=1 2m; 2v; 


Taking logarithms, we see that this is equivalent to maximizing 


m u2 


$ |-inw) - + (10.13) 


i=1 1 


This is the same as the expression in equation (10.12), except that v is replaced by v;. It 
is necessary to search iteratively to find the parameters in the model that maximize the 
expression in equation (10.13). 

The spreadsheet in Table 10.4 indicates how the calculations could be organized for 
the GARCH(1,1) model. The table analyzes data on the S&P 500 between July 18,2005, 
and August 13, 20108 

The numbers in the table are based on trial estimates of the three GARCH(1,1) 
parameters: @,&,and P. The first column in the table records the date. The second column 
counts the days. The third column shows the S&P 500 at the end of day i, S;. The fourth 
column shows the proportional change in the S&P 500 between the end of day i — 1 
and the end of day i. This is u; = (S; — S,_,)/S,_,. The fifth column shows the estimate 


8 The data and calculations can be found at www-2.rotman.utoronto.ca/~hull/riskman. 
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Figure 10.4 S&P 500 Index: July 18, 2005, to August 13, 2010 


of the variance rate, v; = O°, for day i made at the end of day i— 1. On day three, we 
start things off by setting the variance equal to u2. On subsequent days, equation (10.10) 
is used. The sixth column tabulates the likelihood measure, — In(v;) — u? /v;. The values 
in the fifth and sixth columns are based on the current trial estimates of œ, a, and B. We 
are interested in choosing @, &, and B to maximize the sum of the numbers in the sixth 
column. This involves an iterative search procedure? 

In our example, the optimal values of the parameters turn out to be 


@ = 0.0000013465 
a = 0.083394 
B = 0.910116 


and the maximum value of the function in equation (10.13) is 10,228.2349. (The num- 
bers shown in Table 10.4 are actually those calculated on the final iteration of the search 
for the optimal o, a, and B.) 

The long-term variance rate, V,, in our example is 


(O _ 0.0000013465 


— = = 0.0002075 
1-a-ß 0.006490 


The long-term volatility is y 00.0002075 or 1.4404% per day. 
Figures 10.4 and 10.5 show the S&P 500 index and the GARCH(1,1) volatility 
during the five-year period covered by the data. Most of the time, the volatility was less 


? As discussed later, a general-purpose algorithm such as Solver in Microsoft’s Excel can be used. 
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Figure 10.5 GARCH(1,1) Daily Volatility of S&P 500 Index: July 18, 2005, to August 13, 2010 


than 2% per day, but volatilities over 5% per day were experienced during the credit 
crisis. (Very high volatilities are also indicated by the VIX index—-see Figure 10.1.) 

An alternative, more robust approach to estimating parameters in GARCH(1,1) is 
known as variance targeting.!° This involves setting the long-run average variance rate, 
V,, equal to the sample variance calculated from the data (or to some other value that is 
believed to be reasonable). The value of @ then equals V; (1 — a — p) and only two param- 
eters have to be estimated. For the data in Table 10.4, the sample variance is 0.0002412, 
which gives a daily volatility of 1.5531%. Setting V, equal to the sample variance, the val- 
ues of a and f that maximize the objective function in equation (10.13) are 0.08445 and 
0.9101, respectively. The value of the objective function is 10,228.1941, only marginally 
below the value of 10,228.2349 obtained using the earlier procedure. 

When the EWMA model is used, the estimation procedure is relatively simple. We 
set ® = 0, a = 1 — À, and B =A, and only one parameter, A, has to be estimated. In the 
data in Table 10.4, the value of A that maximizes the objective function in equation 
(10.13) is 0.9374 and the value of the objective function is 10,192.5104. 

For both GARCH(1,1) and EWMA, we can use the Solver routine in Excel to search 
for the values of the parameters that maximize the likelihood function. The routine works 
well provided we structure our spreadsheet so that the parameters we are searching for 
have roughly equal values. For example, in GARCH(1,1) we could let cells A1, A2, and 
A3 contain @ X 10°, 10a, and B. We could then set B1=A1/100,000, B2 = A2/10, and 
B3 = A3. We would then use B1, B2, and B3 for the calculations, but we would ask 
Solver to calculate the values of A1, A2, and A3 that maximize the likelihood function. 


10 See R. Engle and J. Mezrich, “GARCH for Groups,” Risk (August 1996): 36—40. 
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Table 10.5 Autocorrelations Before and After the 
Use of aGARCH Model 


Time Autocorr Autocorr 
Lag for u for w/o? 
1 0.183 —0.063 
2 0.385 —0.004 
3 0.160 —0.007 
4 0.301 0.022 
5 0.339 0.014 
6 0.308 —0.011 
7 0.329 0.026 
8 0.207 0.038 
9 0.324 0.041 
10 0.269 0.083 
11 0.431 —0.007 
12 0.286 0.006 
13 0.224 0.001 
14 0.121 0.017 
15 0.222 —0.031 


Sometimes, Solver gives a local maximum, so a number of different starting values for 
the parameter should be tested. 


10.9.3 How Good Is the Model? 


The assumption underlying a GARCH model is that volatility changes with the pas- 
sage of time. During some periods, volatility is relatively high; during other peri- 
ods, it is relatively low. To put this another way, when u? is high, there is a ten- 


2 98 E 2: 2 
dency for U1 Up ios +++ CO be high; when u? is low, there is a tendency for Uys 
W >, ... to be low. We can test how true this is by examining the autocorrelation structure 
of the ur. 


Let us assume that the u? do exhibit autocorrelation. If a GARCH model is working 
well, it should remove the autocorrelation. We can test whether it has done this by 
considering the autocorrelation structure for the variables u? / oO. If these show very little 
autocorrelation, our model for ©; has succeeded in explaining autocorrelations in the W. 

Table 10.5 shows results for the S&P 500 data. The first column shows the lags 
considered when the autocorrelation is calculated. The second column shows autocor- 
relations for u; the third column shows autocorrelations for u? / o?! The table shows 
that the autocorrelations are positive for u? for all lags between 1 and 15. In the case of 
u? / O°, some of the autocorrelations are positive and some are negative. They are much 
smaller in magnitude than the autocorrelations for W. 


For a series x;, the autocorrelation with a lag of k is the coefficient of correlation between x; 
and Xj4p. 
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The GARCH model appears to have done a good job in explaining the data. For a 
more scientific test, we can use what is known as the Ljung-Box statistic.'* If a certain 
series has m observations the Ljung-Box statistic is 


K 
m yy Wel, 
k=1 


where ç is the autocorrelation for a lag of k, K is the number of lags considered, and 


For K = 15, zero autocorrelation can be rejected with 95% confidence when the Ljung- 
Box statistic is greater than 25. 

From Table 10.5, the Ljung-Box statistic for the u? series is about 1,566. This is 
strong evidence of autocorrelation. For the w / o? series, the Ljung-Box statistic is 21.7, 
suggesting that the autocorrelation has been largely removed by the GAR CH model. 


10.10 Using GARCH(1,1) to Forecast Future Volatility 


The variance rate estimated at the end of day n—1 for day n, when GARCH(1,1) is 
used, is 


o = (1-a -BV +a + Bor 


n—1 


so that 


o — V, = au Vi) + B(o?_, - Vz) 


n-1 


On day n + t in the future, we have 


Org -V= (uy - V1) + BlOg -= V1) 
2 ; 2 
The expected value of u7, _, is that of o7, _,. Hence, 
Eloi, - Vi] = (@&+ PELO a - Vi] 


12See G. M. Ljung and G. E. P. Box, “On a Measure of Lack of Fit in Time Series Models,” Biometrica 
65 (1978): 297—303. 
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Figure 10.6 Expected Path for the Variance Rate When (a) Current Variance Rate Is Above 
Long-Term Variance Rate and (b) Current Variance Rate Is Below Long-Term Variance Rate 


where E denotes expected value. Using this equation repeatedly yields 


Ejo? — V] = (a + B)'(o- — vi) 
or 
Efo?, ] = Vi + (a+ B) (02 - V) (10.14) 


This equation forecasts the volatility on day n + t using the information available at the 
end of day n — 1. In the EWMA model, « + 6 = 1 and equation (10.14) show that the 
expected future variance rate equals the current variance rate. When a + P < 1, the final 
term in the equation becomes progressively smaller as f increases. Figure 10.6 shows the 
expected path followed by the variance rate for situations where the current variance 
rate is different from V,. As mentioned earlier, the variance rate exhibits mean rever- 
sion with a reversion level of V, and a reversion rate of 1 — a — P. Our forecast of the 
future variance rate tends toward V, as we look further and further ahead. This analysis 
emphasizes the point that we must have a+ B < 1 for a stable GARCH(1,1) process. 
When a + P > 1, the weight given to the long-term average variance is negative and the 
process is mean fleeing rather than mean reverting. 

For the S&P 500 data considered earlier, a + B = 0.9935 and V, = 0.0002075. Sup- 
pose that our estimate of the current variance rate per day is 0.0003. (This corresponds 
to a volatility of 1.732% per day.) In 10 days, the expected variance rate is 


0.0002075 + 0.9935'"(0.0003 — 0.0002075) = 0.0002942 


Volatility 237 


The expected volatility per day is y 0.0002942 or 1.72%, still well above the long-term 
volatility of 1.44% per day. However, the expected variance rate in 500 days is 


0.0002075 + 0.9935°°°(0.0003 — 0.0002075) = 0.0002110 


and the expected volatility per day is 1.45%, very close to the long-term volatility. 


10.10.1 Volatility Term Structures 
Suppose it is day n. Define 


V() = Elo?) 
and 
1 
=] 
= a+ 


so that equation (10.14) becomes 
VQ=V_+e“[VO) - Vv] 


V (f) is an estimate of the instantaneous variance rate in t days. The average variance rate 
per day between today and time T is 


ME ja 
F V (tdt = V, + -r VO) = V] 


As T increases, this approaches V,. Define o(T) as the volatility per annum that should 
be used to price a T-day option under GARCH(1,1). Assuming 252 days per year, o (T)? 
is 252 times the average variance rate per day, so that 


eat 


oT) = 252 { V, + = 170) - va} (10.15) 
aT 


The relationship between the volatilities of options and their maturities is referred 
to as the volatility term structure. The volatility term structure is usually calculated from 
implied volatilities, but equation (10.15) provides an alternative approach for estimating 
it from the GARCH(1,1) model. Although the volatility term structure estimated from 
GARCH(1,1) is not the same as that calculated from implied volatilities, it is often used to 
predict the way that the actual volatility term structure will respond to volatility changes. 
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Table 10.6 S&P 500 Volatility Term Structure Predicted from GARCH(1,1) 
Option life (days) 10 30 50 100 500 


Option volatility (% per annum) 27.36 27.10 26.87 26.35 24.32 


When the current volatility is above the long-term volatility, the GARCH(1,1) 
model estimates a downward-sloping volatility term structure. When the current volatil- 
ity is below the long-term volatility, it estimates an upward-sloping volatility term 
structure. In the case of the S&P 500 data, a = In(1/0.99351) = 0.006511 and V, = 
0.0002075. Suppose that the current variance rate per day, V (0), is estimated as 0.0003 
per day. It follows from equation (10.15) that 


= 9-006511 T 


o(T) = 252 |0.0002075 + + 


———— (0.0003 — 0.0002075) 
0.006511 T 


where T is measured in days. Table 10.6 shows the volatility per year for different values 
of T. 


10.10.2 Impact of Volatility Changes 


Equation (10.15) can be written as 


a 1— et (0(0)* _ 
P r+ 12e% (208 y,)) 


When o6(0) changes by Ao(0), o(T) changes by approximately 


1— eT oO) 
= sro) (10.16) 


Table 10.7 shows the effect of a volatility change on options of varying maturi- 
ties for our S&P 500 data. We assume as before that V (0) = 0.0003 so that o(0) = 
4/252 x y 0.0003 = 27.50%. The table considers a 100-basis-point change in the instan- 
taneous volatility from 27.50% per year to 28.50% per year. This means that Ac (0) = 0.01 
or 1%. 

Many financial institutions use analyses such as this when determining the exposure 
of their books to volatility changes. Rather than consider an across-the-board increase 
of 1% in implied volatilities when calculating vega, they relate the size of the volatility 
increase that is considered to the maturity of the option. Based on Table 10.7, a 0.97% 


Table 10.7 Impact of 1% Increase in the Instantaneous Volatility Predicted from GARCH(1,1) 
Option life (days) 10 30 50 100 500 


Increase in volatility (%) 0.97 0.92 0.87 0.77 0.33 
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volatility increase would be considered for a 10-day option, a 0.92% increase for a 30-day 
option, a 0.87% increase for a 50-day option, and so on. 


Summary 


In risk management, the daily volatility of a market variable is defined as the standard 
deviation of the percentage daily change in the market variable. The daily variance rate is 
the square of the daily volatility. Volatility tends to be much higher on trading days than 
on nontrading days. As a result, nontrading days are ignored in volatility calculations. It is 
tempting to assume that daily changes in market variables are normally distributed. In fact, 
this is far from true. Most market variables have distributions for percentage daily changes 
with much heavier tails than the normal distribution. The power law has been found to 
be a good description of the tails of many distributions that are encountered in practice. 

This chapter has discussed methods for attempting to keep track of the current level 
of volatility. Define u; as the percentage change in a market variable between the end 
of day i— 1 and the end of day i. The variance rate of the market variable (that is, the 
square of its volatility) is calculated as a weighted average of the te The key feature 
of the methods that have been discussed here is that they do not give equal weight 
to the observations on the u?. The more recent an observation, the greater the weight 
assigned to it. In the EWMA model and the GARCH(1,1) model, the weights assigned to 
observations decrease exponentially as the observations become older. The GARCH(1,1) 
model differs from the EWMA model in that some weight is also assigned to the long- 
run average variance rate. Both the EWMA and GARCH(1,1) models have structures 
that enable forecasts of the future level of variance rate to be produced relatively easily. 

Maximum likelihood methods are usually used to estimate parameters in 
GARCH(1,1) and similar models from historical data. These methods involve using an 
iterative procedure to determine the parameter values that maximize the chance or like- 
lihood that the historical data will occur. Once its parameters have been determined, a 
model can be judged by how well it removes autocorrelation from the ur. 

The GARCH(1,1) model can be used to estimate a volatility for options from his- 
torical data. This analysis is often used to calculate the impact of a shock to volatility on 
the implied volatilities of options of different maturities. 
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Practice Questions and Problems 
(Answers at End of Book) 


10.1 The volatility of an asset is 2% per day. What is the standard deviation of the 
percentage price change in three days? 

10.2 The volatility of an asset is 25% per annum. What is the standard deviation of 
the percentage price change in one trading day? Assuming a normal distribution 
with zero mean, estimate 95% confidence limits for the percentage price change 
in one day. 

10.3 Why do traders assume 252 rather than 365 days in a year when using volatilities? 

10.4 What is implied volatility? What does it mean if different options on the same 
asset have different implied volatilities? 

10.5 Suppose that observations on an exchange rate at the end of the past 11 days 
have been 0.7000, 0.7010, 0.7070, 0.6999, 0.6970, 0.7003, 0.6951, 0.6953, 0.6934, 
0.6923, and 0.6922. Estimate the daily volatility using both equation (10.2) and 
equation (10.4). 

10.6 The number of visitors to websites follows the power law in equation (10.1) with 
a = 2. Suppose that 1% of sites get 500 or more visitors per day. What percentage 
of sites get (a) 1,000 and (b) 2,000 or more visitors per day? 

10.7 Explain the exponentially weighted moving average (EWMA) model for estimat- 
ing volatility from historical data. 

10.8 What is the difference between the exponentially weighted moving average model 
and the GARCH(1,1) model for updating volatilities? 

10.9 The most recent estimate of the daily volatility of an asset is 1.5% and the price 
of the asset at the close of trading yesterday was $30.00. The parameter A in the 
EWMA model is 0.94. Suppose that the price of the asset at the close of trading 
today is $30.50. How will this cause the volatility to be updated by the EWMA 
model? 
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10.10 


10.11 


10.12 


10.13 


10.14 


10.15 


10.16 


10.17 


10.18 


A company uses an EWMA model for forecasting volatility. It decides to change 
the parameter A from 0.95 to 0.85. Explain the likely impact on the forecasts. 
Assume that an index at close of trading yesterday was 1,040 and the daily 
volatility of the index was estimated as 1% per day at that time. The parame- 
ters in a GARCH(1,1) model are œ = 0.000002, a = 0.06, and B = 0.92. If the 
level of the index at close of trading today is 1,060, what is the new volatility 
estimate? 

The most recent estimate of the daily volatility of the dollar-sterling exchange 
rate is 0.6% and the exchange rate at 4:00 P.M. yesterday was 1.5000. The param- 
eter A in the EWMA model is 0.9. Suppose that the exchange rate at 4:00 P.M. 
today proves to be 1.4950. How would the estimate of the daily volatility be 
updated? 

A company uses the GAR CH(1,1) model for updating volatility. The three param- 
eters are œ, a, and B. Describe the impact of making a small increase in each of 
the parameters while keeping the others fixed. 

The parameters of a GARCH(1,1) model are estimated as œ = 0.000004, œ = 
0.05, and B = 0.92. What is the long-run average volatility and what is the 
equation describing the way that the variance rate reverts to its long-run aver- 
age? If the current volatility is 20% per year, what is the expected volatility in 
20 days? 

Suppose that the daily volatility of the FTSE 100 stock index (measured in pounds 
sterling) is 1.8% and the daily volatility of the dollar—sterling exchange rate is 0.9%. 
Suppose further that the correlation between the FTSE 100 and the dollar—sterling 
exchange rate is 0.4. What is the volatility of the FTSE 100 when it is translated 
to US. dollars? Assume that the dollar—sterling exchange rate is expressed as the 
number of U.S. dollars per pound sterling. (Hint: When Z = XY, the percentage 
daily change in Z is approximately equal to the percentage daily change in X plus 
the percentage daily change in Y.) 

Suppose that GARCH(1,1) parameters have been estimated as œ = 0.000003, 0 = 
0.04, and B = 0.94. The current daily volatility is estimated to be 1%. Estimate the 
daily volatility in 30 days. 

Suppose that GARCH(1,1) parameters have been estimated as œ = 0.000002, 0 = 
0.04, and B = 0.94. The current daily volatility is estimated to be 1.3%. Estimate 
the volatility per annum that should be used to price a 20-day option. 


Further Questions 


Suppose that observations on a stock price (in dollars) at the end of each of 15 
consecutive days are as follows: 


30.2, 32.0, 31.1, 30.1, 30.2, 30.3, 30.6, 30.9, 30.5, 31.1, 31.3, 30.8, 30.3, 29.9, 29.8 


Estimate the daily volatility using both equation (10.2) and equation (10.4). 
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10.19 


10.20 


10.21 


10.22 


10.23 


Suppose that the price of an asset at close of trading yesterday was $300 and its 
volatility was estimated as 1.3% per day. The price at the close of trading today is 
$298. Update the volatility estimate using 

(a) The EWMA model with À = 0.94 

(b) The GARCH(1,1) model with œ = 0.000002, a = 0.04, and B = 0.94. 

An Excel spreadsheet containing over 900 days of daily data on a number of 
different exchange rates and stock indices can be downloaded from the author’s 
website: www-2.rotman.utoronto.ca/~hull/data. Choose one exchange rate and 
one stock index. Estimate the value of À in the EWMA model that minimizes the 
value of 


where v; is the variance forecast made at the end of day i — 1 and 6; is the variance 

calculated from data between day i and day i + 25. Use the Solver tool in Excel. 

To start the EWMA calculations, set the variance forecast at the end of the first 

day equal to the square of the return on that day. 

Suppose that the parameters in a GARCH(1,1) model are « = 0.03, B = 0.95, and 

œ = 0.000002. 

(a) What is the long-run average volatility? 

(b) If the current volatility is 1.5% per day, what is your estimate of the volatility 
in 20, 40, and 60 days? 

(c) What volatility should be used to price 20-, 40-, and 60-day options? 

(d) Suppose that there is an event that increases the volatility from 1.5% per day 
to 2% per day. Estimate the effect on the volatility in 20, 40, and 60 days. 

(e) Estimate by how much the event increases the volatilities used to price 20-, 
40-, and 60-day options. 

Estimate parameters for the EWMA and GARCH(1,1) models on the euro-USD 

exchange rate data between July 27, 2005, and July 27, 2010. These data can be 

found on the author’s website: www-2.rotman.utoronto.ca/~hull/data. 

The probability that the loss from a portfolio will be greater than $10 million in 

one month is estimated to be 5%. 

(a) What is the loss that has a 1% chance of being exceeded, assuming the change 
in value of the portfolio is normally distributed with zero mean? 

(b) What is the loss that has a 1% chance of being exceeded, assuming that the 
power law applies with a = 3? 
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uppose that a company has an exposure to two different market variables. In the 

case of each variable, it gains $10 million if there is a one-standard-deviation 

increase and loses $10 million if there is a one-standard-deviation decrease. If 
changes in the two variables have a high positive correlation, the company’s total exposure 
is very high; if they have a correlation of zero, the exposure is less but still quite large; if 
they have a high negative correlation, the exposure is quite low because a loss on one 
of the variables is likely to be offset by a gain on the other. This example shows that it 
is important for a risk manager to estimate correlations between the changes in market 
variables as well as their volatilities when assessing risk exposures. 

This chapter explains how correlations can be monitored in a similar way to volatil- 
ities. It also covers what are known as copulas. These are tools that provide a way of 
defining a correlation structure between two or more variables, regardless of the shapes 
of their probability distributions. Copulas have a number of applications in risk man- 
agement. The chapter shows how a copula can be used to create a model of default 
correlation for a portfolio of loans. This model is used in the Basel II capital requirements. 


11.1 Definition of Correlation 


The coefficient of correlation, p, between two variables V} and V, is defined as 


_ EV V) — EVJE) 
P = ~~ SDSD) ron) 
1 2 
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where E(.) denotes expected value and SD(.) denotes standard deviation. If there is no 
correlation between the variables, E(V, V2) = E(V,)E(V>) and p = 0. If V, = V,, both 
the numerator and the denominator in the expression for p equal the variance of V}. As 
we would expect, p = 1 in this case. 

The covariance between V, and V, is defined as 


cov(V,, Vz) = E(V, V3) — E(V,) E(V3) (11.2) 


so that the correlation can be written 


_ co(n, V) 
P = SD(V,)SD(Vy) 


Although it is easier to develop intuition about the meaning of a correlation than a 
covariance, it is covariances that will prove to be the fundamental variables of our analysis. 
An analogy here is that variance rates were the fundamental variables for the EWMA 
and GARCH models in Chapter 10, even though it is easier to develop intuition about 
volatilities. 


11.1.1 Correlation vs. Dependence 


Two variables are defined as statistically independent if knowledge about one of them 
does not affect the probability distribution for the other. Formally, V} and V, are inde- 
pendent if: 


SVV = x) = f (V2) 


for all x where f(.) denotes the probability density function and | is the symbol denoting 
“conditional on.” 

If the coefficient of correlation between two variables is zero, does this mean that 
there is no dependence between the variables? The answer is no. We can illustrate this 
with a simple example. Suppose that there are three equally likely values for V;:—1, 0, 
and +1.1f V, = —1 or V, = +1, then V, = +1.If V} = 0, then V, = 0. In this case, there 
is clearly a dependence between V; and V3. If we observe the value of V,, we know the 
value of V,. Also, a knowledge of the value of V, will cause us to change our probability 
distribution for V,. However, because E(V, V2) = 0 and E(V,) = 0, it is easy to see that 
the coefficient of correlation between V; and V, is zero. 

This example emphasizes the point that the coefficient of correlation measures one 
particular type of dependence between two variables. This is linear dependence. There are 
many other ways in which two variables can be related. We can characterize the nature of 
the dependence between V; and V, by plotting E(V,) against V,;. Three examples are 
shown in Figure 11.1. Figure 11.1(a) shows linear dependence where the expected value 
of V, depends linearly on V,. Figure 11.1(b) shows a V-shaped relationship between the 
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EV) EV) 


(a) (b) 


EV) 


(c) 
Figure 11.1 Examples of Ways in Which V, Can Be Dependent on V, 


expected value of V, and the value of V,. (This is similar to the simple example just 
considered; a symmetrical V-shaped relationship, however strong, leads to zero coeffi- 
cient of correlation.) Figure 11.1(c) shows a type of dependence that is often seen when 
V, and V, are percentage changes in financial variables. For the values of V, normally 
encountered, there is very little relation between V} and V,. However, extreme values 
of V; tend to lead to extreme values of V,. (This could be consistent with correlations 
increasing in stressed market conditions.) 

Another aspect of the way in which V, depends on V, is found by examining the 
standard deviation of V, conditional on V}. As we will see later, this is constant when 
V, and V, have a bivariate normal distribution. But, in other situations, the standard 
deviation of V, is liable to depend on the value of V}. 


11.2 Monitoring Correlation 


Chapter 10 explained how exponentially weighted moving average and GARCH models 
can be developed to monitor the variance rate of a variable. Similar approaches can be 
used to monitor the covariance rate between two variables. The variance rate per day 
of a variable is the variance of daily returns. Similarly, the covariance rate per day between 
two variables is defined as the covariance between the daily returns of the variables. 
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Suppose that X; and Y, are the values of two variables, X and Y, at the end of day i. 
The returns on the variables on day i are 


X; — X Y= Y; 1 


X: — — 
i ka Yi Ya 


j= 


The covariance rate between X and Y on day n is from equation (11.2): 
cov, = EX Yn) = E(x EY) 


In Section 10.5, we explained that risk managers assume that expected daily returns are 
zero when the variance rate per day is calculated. They do the same when calculating 
the covariance rate per day. This means that the covariance rate per day between X and 
Y on day n is assumed to be 


COV, = E(XnYn) 


Using equal weights for the last m observations on x; and y; gives the estimate 


m 


1 
cov, = — Xn iV ni 11.3 
5 p ae (11.3) 


A similar weighting scheme for variances gives an estimate for the variance rate on day 
n for variable X as 


Var = 


m 
» 2 
X 
n—i 
i=1 


S| 


and for variable Y as 


m 


_! 2 
Val yn = m > ) 
i=1 


The correlation estimate on day n is 


11.2.1 EWMA 


Most risk managers would agree that observations from long ago should not have as much 
weight as recent observations. In Chapter 10, we discussed the use of the exponentially 
weighted moving average (EWMA) model for variances. We saw that it leads to weights 
that decline exponentially as we move back through time. A similar weighting scheme can 
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be used for covariances. The formula for updating a covariance estimate in the EWMA 
model is similar to that in equation (10.8) for variances: 


cov, = cov, + (1 = NX pV n-1 


A similar analysis to that presented for the EWMA volatility model shows that the weight 


given to x,_;),—; declines as i increases (i.e., as we move back through time). The lower 


n— 
the value of A, the greater the weight that is given to recent observations. 


Example 11.1 


Suppose that A = 0.95 and that the estimate of the correlation between two variables X 
and Y on day n—1 is 0.6. Suppose further that the estimates of the volatilities for X 
and Y on day n — 1 are 1% and 2%, respectively. From the relationship between correla- 
tion and covariance, the estimate of the covariance rate between X and Y on day n— 1 
is 


0.6 X 0.01 X 0.02 = 0.00012 


Suppose that the percentage changes in X and Y on day n—1 are 0.5% and 2.5%, 
respectively. The variance rates and covariance rate for day n would be updated as 
follows: 


o2 „= 0.95 x 0.017 + 0.05 x 0.0057 = 0.00009625 


o7 „ = 0.95 X 0.02? + 0.05 x 0.0257 = 0.00041125 


cov, = 0.95 X 0.00012 + 0.05 x 0.005 x 0.025 = 0.00012025 


The new volatility of X is /0.00009625 = 0.981%, and the new volatility of Y is 
0.00041125 = 2.028%. The new correlation between X and Y is 


0.00012025 


———_——— = 0.6044 
0.00981 x 0.02028 


11.2.2 GARCH 


GARCH models can also be used for updating covariance rate estimates and forecasting 
the future level of covariance rates. For example, the GARCH(1,1) model for updating 
a covariance rate between X and Y is 


cov, = a+ OX -1V n= 1 + Bcov,,_ 
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Table 11.1 A Correlation Matrix 


1 P2 Pia = Pin 
Pa 1 P3 Pon 
P31 P32 1 P3n 
Pri Pn2 Png ii 1 


pj is the correlation between variable í and variable j. 


This formula, like its counterpart in equation (10.10) for updating variances, gives 
some weight to a long-run average covariance, some to the most recent covariance esti- 
mate, and some to the most recent observation on covariance (which is x,_,y,,_;). The 
long-term average covariance rate is @/(1 — a — B). Formulas similar to those in equa- 
tions (10.14) and (10.15) can be developed for forecasting future covariance rates and 
calculating the average covariance rate during a future time period. 


11.3 Correlation and Covariance Matrices 


Once variances, covariances, and correlations have been calculated for a set of variables, 
correlation matrices and covariance matrices can be produced. 

A correlation matrix is shown in Table 11.1. It displays coefficients of correlation. 
Because a variable is always perfectly correlated with itself, the diagonal elements of a 
correlation matrix are always 1. Furthermore, because p; = p; the correlation matrix is 
symmetric. 

It is often more convenient to work with a variance-covariance matrix. When i £ j, 
the (i,j) element in this matrix is the covariance between variable i and variable j. When 
i = j,it is the variance of variable i. (See Table 11.2.) 


Table 11.2 A Variance-Covariance Matrix 


var, COV12 COV 13 aie COV)», 

COV91 vary COV23 vee COV), 
COV31 COV39 vars ssa COV3,, 
COV) 1 COV,2 COV) 3 sists var, 


cov; is the covariance between variable í and variable j and var; = cov; is the 


ij 
variance of variable i. 
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11.3.1 Consistency Condition for Covariances 


Not all variance-covariance matrices are internally consistent. The condition for an 
N x N variance-covariance matrix, Q, to be internally consistent is 


w!Qw > 0 (11.4) 


T is the transpose of w. A matrix that satisfies this 


for all N X 1 vectors w where w 
property is known as posifive-semidefinite. 

To understand why the condition in equation (11.4) must hold, suppose that w is 
the (column) vector (w4, w2, ..., wn). The expression w! Qw is the variance rate of a 
portfolio where an amount w; is invested in market variable i. As such, it cannot be 
negative. 

To ensure that a positive-semidefinite matrix is produced, variances and covariances 
should be calculated consistently. For example, if variance rates are calculated by giving 
equal weight to the last m data items, the same should be done for covariance rates. 
If variance rates are updated using an EWMA model with à = 0.94, the same should 
be done for covariance rates. Using a GARCH model to update a variance-covariance 
matrix in a consistent way is trickier and requires a multivariate model. 

An example of a variance-covariance matrix that is not internally consistent is 


1 0 0.9 
0 1 0.9 
0.9 0.9 1 


The variance of each variable is 1.0 and so the covariances are also coefficients of 
correlation in this case. The first variable is highly correlated with the third variable, and 
the second variable is also highly correlated with the third variable. However, there is no 
correlation at all between the first and second variables. This seems strange. When we 
set wT equal to (1, 1, —1), we find that the condition in equation (11.4) is not satisfied, 
proving that the matrix is not positive-semidefinite. 

If we make a small change to a positive-semidefinite matrix that is calculated from 
observations on three variables (e.g., for the purposes of doing a sensitivity analysis), it 
is likely that the matrix will remain positive-semidefinite. However, if we do the same 
thing for observations on 100 variables, we have to be much more careful. An arbitrary 
small change to a positive-semidefinite 100 X 100 matrix is quite likely to lead to it no 
longer being positive-semidefinite. 


'See R. Engle and J. Mezrich, “GARCH for Groups,” Risk (August 1996): 36—40, for a discussion of 
alternative approaches. 
?Tt can be shown that the condition for a 3 X 3 matrix of correlations to be internally consistent is 


Pio + P? + P53 — 2P12P13P23 < 1 


where pj is the coefficient of correlation between variables i and j. 
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11.4 Multivariate Normal Distributions 


Multivariate normal distributions are well understood and relatively easy to deal with. As 
we will explain in the next section, they can be useful tools for specifying the correlation 
structure between variables, even when the distributions of the variables are not normal. 

We start by considering a bivariate normal distribution where there are only two 
variables, V; and V3. Suppose that we know that V; has some value. Conditional on this, 
the value of V, is normal with mean 


Vi =p 
+ po, ——— 
H2 T PO» o, 


and standard deviation 
o y1- p? 


Here p and p, are the unconditional means of V, and V,, 6, and ©, are their 
unconditional standard deviations, and p is the coefficient of correlation between V} 
and V,. Note that the expected value of V, conditional on V; is linearly dependent on 
the value of V,. This corresponds to Figure 11.1(a). Also, the standard deviation of V, 
conditional on the value of V} is the same for all values of V}. 

Software for calculating the cumulative bivariate normal distribution is on the 
author’s website: 


www-2.rotman.utoronto.ca/~hull/riskman 


11.4.1 Generating Random Samples from Multivariate Normal Distributions 


Most programming languages have routines for sampling a random number between 
zero and one, and many have routines for sampling from a normal distribution? 

When samples £} and £, from a bivariate normal distribution (where both variables 
have mean zero and standard deviation one) are required, the usual procedure involves 
first obtaining independent samples z, and z, from a univariate standard normal dis- 
tribution (i.e., a normal distribution with mean zero and standard deviation one) are 
obtained. The required samples €, and £, are then calculated as follows: 


where is the coefficient of correlation in the bivariate normal distribution. 


3In Excel, the instruction =NORMSINV(RAND() gives a random sample from a normal distribu- 
tion. 
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Consider next a situation where we require samples from a multivariate normal dis- 
tribution (where all variables have mean zero and standard deviation one) and the coeffi- 
cient of correlation between variable i and variable j is p,;. We first sample n independent 
variables z, (1 < i < n) from univariate standard normal distributions. The required sam- 
ples are g; (1 < i < n), where 


m 
II 


Apk (11.5) 
k=1 


and the Q; are parameters chosen to give the correct variances and the correct correla- 
tions for €;. For 1 < j < i, we must have 


i 

DP 
yu. = 1 
k=1 


and, for all j < i, 


Jj 
» Hip Qin = Pij 
= 


The first sample, €,, is set equal to z4. These equations can be solved so that £, is 
calculated from z, and z3, €, is calculated from z4, z2, and z3, and so on. The procedure 
is known as the Cholesky decomposition. (See Problem 11.9.) 

If we find ourselves trying to take the square root of a negative number when using 
the Cholesky decomposition, the variance-covariance matrix assumed for the variables 
is not internally consistent. As explained in Section 11.3.1, this is equivalent to saying 
that the matrix is not positive-semidefinite. 


11.4.2 Factor Models 


Sometimes the correlations between normally distributed variables are defined using a 
factor model. Suppose that U,, U;,..., Ux, have standard normal distributions (i.e., nor- 
mal distributions with mean zero and standard deviation one). In a one-factor model, 
each U,(1 < i < N) has a component dependent on a common factor, F, and a compo- 
nent that is uncorrelated with the other variables. Formally, 


U, = qF + 4/1- Z, (11.6) 


where F and the Z, have standard normal distributions and a; is a constant between 
—1 and +1. The Z; are uncorrelated with each other and uncorrelated with F. The 
coefficient of Z; is chosen so that U; has a mean of zero and a variance of one. In 
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this model, all the correlation between U; and U; arises from their dependence on the 
common factor, F. The coefficient of correlation beves U; and U; is a;a;. 

A one-factor model imposes some structure on the correlations a has the advantage 
that the resulting covariance matrix is always positive-semidefinite. Without assuming a 
factor model, the number of correlations that have to be estimated for the N variables is 
N(N — 1)/2. With the one-factor model, we need only estimate N parameters: a,,d5,..., 
ayn. An example of a one-factor model from the world of investments is the capital asset 
pricing model where the return on a stock has a component dependent on the return 
from the market and an idiosyncratic (nonsystematic) component that is independent of 
the return on other stocks (see Section 1.3). 

The one-factor model can be extended to a two-factor, three-factor, or M-factor 
model. In the M-factor model 


U, = ay Fy + a9Fo + ... + ayy Fy + (11.7) 


The factors, F}, F5,..., Fy have uncorrelated standard normal distributions and the Z; 
are uncorrelated both with each other and with the factors. In this case, the correlation 
between U; and U, is 


M 


J / Aim Aim 


m=1 


11.5 Copulas 


Consider two correlated variables, V; and V,. The marginal distribution of V, (sometimes 
also referred to as the unconditional distribution) is its distribution assuming we know 
nothing about V3; similarly, the marginal distribution of V, is its distribution assuming 
we know nothing about V}. Suppose we have estimated the marginal distributions of V, 
and V,. How can we make an assumption about the correlation structure between the 
two variables to define their joint distribution? 

If the marginal distributions of V, and V, are normal, a convenient and easy-to- 
work-with assumption is that the joint distribution of the variables is bivariate normalt 
(The correlation structure between the variables is then as described in Section 11.4.) 
Similar assumptions are possible for some other marginal distributions. But often there 
is no natural way of defining a correlation structure between two marginal distributions. 
This is where copulas come in. 


4 Although the bivariate normal assumption is a convenient one, it is not the only one that can be 
made. There are many other ways in which two normally distributed variables can be dependent on 
each other. For example, we could have Vy = V} for —k < V; < kand V) = —V, otherwise. See also 
Problem 11.11. 
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12 
(a) (b) 
Figure 11.2 Triangular Distributions for V} and V3 


As an example of the application of copulas, suppose that variables V, and V, have 
the triangular probability density functions shown in Figure 11.2. Both variables have 
values between 0 and 1. The density function for V, peaks at 0.2. The density function 
for V, peaks at 0.5. For both density functions, the maximum height is 2.0 (so that the 
area under the density function is 1.0). To use what is known as a Gaussian copula, we 
map V, and V, into new variables U} and U, that have standard normal distributions. 
(A standard normal distribution is a normal distribution with mean zero and standard 
deviation one.) The mapping is accomplished on a percentile-to-percentile basis. The 
one-percentile point of the V, distribution is mapped to the one-percentile point of 
the U; distribution; the 10-percentile point of the V, distribution is mapped to the 10- 
percentile point of the U, distribution; and so on. V, is mapped into U, in a similar way. 
Table 11.3 shows how values of V, are mapped into values of U}. Table 11.4 similarly 
shows how values of V, are mapped into values of U,. Consider the V, = 0.1 calculation 
in Table 11.3. The cumulative probability that V; is less than 0.1 is (by calculating areas 


Table 11.3 Mapping of V, That Has the Triangular 
Distribution in Figure 11.2(a) to U} That Has a 
Standard Normal Distribution 


V, Percentile U, 

Value of Distribution Value 
0.1 5.00 —1.64 
0.2 20.00 —0.84 
0.3 38.75 —0.29 
0.4 55.00 0.13 
0.5 68.75 0.49 
0.6 80.00 0.84 
0.7 88.75 1.21 
0.8 95.00 1.64 


0.9 98.75 2.24 
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Table 11.4 Mapping of V, That Has the Triangular 
Distribution in Figure 11.2(b) to U, That Has a 
Standard Normal Distribution 


V, Percentile U, 

Value of Distribution Value 
0.1 2.00 —2.05 
0.2 8.00 —1.41 
0.3 18.00 —0.92 
0.4 32.00 —0.47 
0.5 50.00 0.00 
0.6 68.00 0.47 
0.7 82.00 0.92 
0.8 92.00 1.41 
0.9 98.00 2.05 


Table 11.5 Cumulative Joint Probability Distribution for V} and V, in the Gaussian Copula 
Model (Correlation parameter = 0.5. Table shows the joint probability that V; and V3 are less than 
the specified values.) 


vı 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 


0.1 0.006 0.017 0.028 0.037 0.044 0.048 0.049 0.050 0.050 
0.2 0.013 0.043 0.081 0.120 0.156 0.181 0.193 0.198 0.200 
0.3 0.017 0.061 0.124 0.197 0.273 0.331 0.364 0.381 0.387 
0.4 0.019 0.071 0.149 0.248 0.358 0.449 0.505 0.535 0.548 
0.5 0.019 0.076 0.164 0.281 0.417 0.537 0.616 0.663 0.683 
0.6 0.020 0.078 0.173 0.301 0.456 0.600 0.701 0.763 0.793 
0.7 0.020 0.079 0.177 0.312 0.481 0.642 0.760 0.837 0.877 
0.8 0.020 0.080 0.179 0.318 0.494 0.667 0.798 0.887 0.936 
0.9 0.020 0.080 0.180 0.320 0.499 0.678 0.816 0.913 0.970 


of triangles) 0.5 x 0.1 x 1 = 0.05 or 5%. The value 0.1 for V, therefore gets mapped to 
the five-percentile point of the standard normal distribution. This is —1.643 

The variables, U, and U,, have normal distributions. We assume that they are jointly 
bivariate normal. This in turn implies a joint distribution and a correlation structure 
between V, and V,. The essence of the copula is therefore that, instead of defining a 
correlation structure between V; and V, directly, we do so indirectly. We map V, and 
V, into other variables that have well-behaved distributions and for which it is easy to 
define a correlation structure. 

Suppose that we assume that the correlation between U, and U, is 0.5. The joint 
cumulative probability distribution between V} and V, is shown in Table 11.5. To illus- 
trate the calculations, consider the first one where we are calculating the probability that 
V, < 0.1 and V, < 0.1. From Tables 11.3 and 11.4, this is the same as the probability that 


>It can be calculated using Excel: NORMSINV(0.05) = —1.64. 
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Figure 11.3 The Way in Which a Copula Model Defines a Joint Distribution 


U, < —1.64 and U, < —2.05. From the cumulative bivariate normal distribution, this is 
0.006 when p = 0.5 (Note that the probability would be only 0.02 x 0.05 = 0.001 if 
p=0.) 

The correlation between U, and U, is referred to as the copula correlation. This is not, 
in general, the same as the coefficient of correlation between V; and V,. Because U} and 
U, are bivariate normal, the conditional mean of U, is linearly dependent on U, and the 
conditional standard deviation of U, is constant (as discussed in Section 11.4). However, 
a similar result does not in general apply to V, and V3. 


11.5.1 Expressing the Approach Algebraically 


The way in which a Gaussian copula defines a joint distribution is illustrated in Fig- 
ure 11.3. For a more formal description of the model, suppose that G, and G, are the 
cumulative marginal (i.e., unconditional) probability distributions of V} and V,. We map 
Vi =v, to U, =u, and V, = v, to U, = up so that 


Gi (v) = N (u) 


é An Excel function for calculating the cumulative bivariate normal distribution is on the author’s 
website: www-2.rotman.utoronto.ca/~hull/riskman. 
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and 
G,(v2) = N(u) 
where N is the cumulative normal distribution function. This means that 


u = NG (v)] Uy = N [G (v>)] 
vı = G '[N(u)] v = G5! [N(u)] 


The variables U, and U, are then assumed to be bivariate normal. The key property 
of a copula model is that it preserves the marginal distributions of V} and V, (however 
unusual these may be) while defining a correlation structure between them. 


11.5.2 Other Copulas 


The Gaussian copula is just one copula that can be used to define a correlation structure 
between V, and V,. There are many other copulas leading to many other correlation 
structures. One that is sometimes used is the Student’s t-copula. This works in the same 
way as the Gaussian copula except that the variables U} and U, are assumed to have 
a bivariate Student’s t-distribution instead of a bivariate normal distribution. To sample 
from a bivariate Student’s t-distribution with f degrees of freedom and correlation p, the 
steps are as follows: 


1. Sample from the inverse chi-square distribution to get a value y. (In Excel, the CHI- 
INV function can be used. The first argument is RAND() and the second is f.) 

2. Sample from a bivariate normal distribution with correlation p as described in 
Section 11.4. 

3. Multiply the normally distributed samples by VF Ix 


11.5.3 Tail Dependence 


Figure 11.4 shows plots of 5,000 random samples from a bivariate normal distribution, 
while Figure 11.5 does the same for the bivariate Student’s t. The correlation parameter 
is 0.5 and the number of degrees of freedom for the Student’s t is 4. Define a tail value 
of a distribution as a value in the left or right 1% tail of the distribution. There is a 
tail value for the normal distribution when the variable is greater than 2.33 or less than 
—2.33. Similarly, there is a tail value in the Student’s t-distribution when the value of the 
variable is greater than 3.75 or less than —3.75. Vertical and horizontal lines in the figures 
indicate when tail values occur. The figures illustrate that it is much more common for the 
two variables to have tail values at the same time in the bivariate Student’s t-distribution 
than in the bivariate normal distribution. To put this another way, the tail dependence is 
higher in a bivariate Student’s t-distribution than in a bivariate normal distribution. We 
made the point earlier that correlations between market variables tend to increase in 
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54 


Figure 11.4 Five Thousand Random Samples from a Bivariate Normal Distribution 


extreme market conditions, so that Figure 11.1(c) is sometimes a better description of 
the correlation structure between two variables than Figure 11.1(a). This has led some 
researchers to argue that the Student’s t-copula provides a better description of the joint 
behavior of two market variables than the Gaussian copula. 


=104 


Figure 11.5 Five Thousand Random Samples from a Bivariate Student’s t-distribution with Four 
Degrees of Freedom 
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11.5.4 Multivariate Copulas 


Copulas can be used to define a correlation structure between more than two variables. 
The simplest example of this is the multivariate Gaussian copula. Suppose that there are N 
variables, V,, V>,..., Vy and that we know the marginal distribution of each variable. For 
each i (1 <i < N), we transform V; into U; where U, has a standard normal distribution. 
(As described earlier, the transformation is accomplished on a percentile-to-percentile 
basis.) We then assume that the U, have a multivariate normal distribution. 


11.5.5 A Factor Copula Model 


In multivariate copula models, analysts often assume a factor model for the correlation 
structure between the U;. When there is only one factor, equation (11.6) gives 


U, = qF + 4/1- Z, (11.8) 


where F and the Z; have standard normal distributions. The Z; are uncorrelated with 
each other and with F. 

Other factor copula models are obtained by choosing F and the Z; to have other 
zero-mean unit-variance distributions. We discuss this further in connection with credit 
risk in the next section. These distributional choices affect the nature of the dependence 
between the U-variables and therefore that between the V-variables. 


11.6 Application to Loan Portfolios: Vasicek’s Model 


We now present an application of the one-factor Gaussian copula model that will prove 
useful in understanding the Basel II capital requirements, which are discussed in Chap- 
ter 15. Suppose a bank has a large portfolio of loans where the probability of default 
per year for each loan is 1%. If the loans default independently of each other, we would 
expect the default rate to be almost exactly 1% every year. In practice, loans do not default 
independently of each other. They are all influenced by macroeconomic conditions. As 
a result, in some years the default rate is high whereas in others it is low. This is illus- 
trated by Table 11.6, which shows the default rate for all rated companies between 1970 
and 2016. The default rate varies from a low of 0.088% in 1979 to a high of 4.996% 
in 2009. Other high-default-rate years were 1970, 1989, 1990, 1991, 1999, 2000, 2001, 
2002, 2008, and 2016. 

To model the defaults of the loans in a portfolio, we define T; as the time when 
company i defaults. (There is an implicit assumption that all companies will default 
eventually—but the default may happen a long time, perhaps even hundreds of years, 
in the future.) We make the simplifying assumption that all loans have the same 
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Table 11.6 Annual Percentage Default Rate for All Rated Companies, 1970—2016 


Year Default Rate Year Default Rate Year Default Rate 
1970 2.631 1986 1.830 2002 2.924 
1971 0.286 1987 1.423 2003 1.828 
1972 0.453 1988 1.393 2004 0.834 
1973 0.456 1989 2.226 2005 0.647 
1974 0.275 1990 3.572 2006 0.593 
1975 0.361 1991 2.803 2007 0.349 
1976 0.176 1992 1.337 2008 2.507 
1977 0.354 1993 0.899 2009 4.996 
1978 0.354 1994 0.651 2010 1.232 
1979 0.088 1995 0.899 2011 0.906 
1980 0.344 1996 0.506 2012 1.230 
1981 0.162 1997 0.616 2013 1.232 
1982 1.040 1998 1.137 2014 0.939 
1983 0.900 1999 2.123 2015 1.732 
1984 0.869 2000 2.455 2016 2.149 
1985 0.952 2001 3.679 


Source: Moody’. 


unconditional cumulative probability distribution for the time to default and define PD 


as the probability of default by time T: 
PD = Prob(T;, < T). 


The Gaussian copula model can be used to define a correlation structure between the 
times to default of the loans. Following the procedure we have described, each time 
to default T; is mapped to a variable U, that has a standard normal distribution on a 
percentile-to-percentile basis. 

We assume the factor model in equation (11.8) for the correlation structure is 
between the U, and make the simplifying assumption that the a; are all the same and 


equal to a so that: 
U, = aF + V1- &@ Z; 
As in equation (11.8), the variables F and Z; have independent standard normal dis- 
tributions. The copula correlation between each pair of loans is in this case the same. 
It is 
2 


p=a 


so that the expression for U; can be written 


U, = pF + V1- pZ; (11.9) 
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Define the “worst case default rate,’ WCDR (T, X), as the default rate (i.e., percent- 
age of loans defaulting) during time T that will not be exceeded with probability X. (In 
many applications T will be one year.) As shown in what follows, the assumptions we 
have made lead to 


(11.10) 


(=> + ann) 
WCDR(T, X) = N| —— 


Vi=p 

This is a strange-looking result, but a very important one. It was first developed by 
Vasicek in 1987.’ Nand N`! are the cumulative normal and inverse cumulative normal 
distribution functions. They can be calculated using the NORMSDIST and NORM- 
SINV functions in Excel. Note that if p = 0, the loans default independently of each 
other and WCDR = PD. As p increases, WCDR increases. 
Example 11.2 


Suppose that a bank has a large number of loans to retail customers. The one-year prob- 
ability of default for each loan is 2% and the copula correlation parameter, p, in Vasicek’s 
model is estimated as 0.1. In this case, 


WCDR(1, 0.999) = N 
1=6.1 


N7-1(0,02) + iNew) er 


showing that the 99.9% worst case one-year default rate is 12.8%. 


11.6.1 Proof of Vasicek’s Result 


From the properties of the Gaussian copula model 
PD = Prob(T, < T) = Prob(U; < U) 
where 
U = N ' [PD] (11.11) 
The probability of default by time T depends on the value of the factor, F, in equation 
(11.9). The factor can be thought of as an index of macroeconomic conditions. If F 


is high, macroeconomic conditions are good. Each U, will then tend to be high and 
the corresponding T; will therefore also tend to be high, meaning that the probability 


7See O. Vasicek, “Probability of Loss on a Loan Portfolio” (Working Paper, KMV, 1987). Vasicek’s 
results were published in Risk in December 2002 under the title “Loan Portfolio Value.” 
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of an early default is low and therefore Prob(T, < T) is low. If F is low, macroeconomic 
conditions are bad. Each U, and the corresponding T; will then tend to be low so that the 
probability of an early default is high. To explore this further, we consider the probability 
of default conditional on F. 

From equation (11.9), 


1 V1—p 


The probability that U; < U conditional on the factor value, F, is 


U — 4/pF U — 4/pF 
Prob(U, < U|F) = Prob (z < L =N (=) 
V1-—p V/1-—p 


This is also Prob(T, < T|F) so that 
U — ,/pF 
Prob(T; < TIP = N| ——— (11.12) 
From equation (11.11) this becomes 


Prob(T, < T|F) = ~( (11.13) 


N71(PD) — =) 
vi=p 


For a large portfolio of loans with the same PD, where the copula correlation for each pair 
of loans is p, this equation provides a good estimate of the percentage of loans defaulting 
by time T conditional on F. We will refer to this as the default rate. 

As F decreases, the default rate increases. How bad can the default rate become? 
Because F has a normal distribution, the probability that F will be less than N7!(Y) is 
Y. There is therefore a probability of Y that the default rate will be greater than 


N! (PD) — pN (Y) 
<6 (a a, Aes 
Vi=p 
The default rate that we are X% certain will not be exceeded in time T is obtained by 


substituting Y = 1 — X into the preceding expression. Because N7! (X) = —N7!(1 — X), 
we obtain equation (11.10). 
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11.6.2 Estimating PD and p 


The maximum likelihood methods explained in Chapter 10 can be used to estimate PD 
and p from historical data on default rates. We used equation (11.10) to calculate a high 
percentile of the default rate distribution, but it is actually true for all percentiles. If DR 
is the default rate and G(DR) is the cumulative probability distribution function for DR, 
equation (11.10) shows that 


(=> + a 
DR = N| —— r 
Vi-p 


Rearranging this equation, 


(11.14) 


_ -1 _ N-I 
ania ( 1—pN-'(DR)-N =) 


vp 


Differentiating this, the probability density function for the default rate is 


= -1 _ nel 2 
(DR) = 4 ==? exp 4 +) (NDR)? - =e) 
p 2 e 


(11.15) 


The procedure for calculating maximum likelihood estimates for PD and p from 
historical data is as follows: 


1. Choose trial values for PD and p. 

2. Calculate the logarithm of the probability density in equation (11.15) for each of the 
observations on DR. 

3. Use Solver to search for the values of PD and p that maximize the sum of the values 
in 2. 


One application of this is to the data in Table 11.6. The estimates for p and PD 
given by these data are 0.098 and 1.32%, respectively. (See worksheet on the author’s 
website for the calculations.) The probability distribution for the default rate is shown 
in Figure 11.6. The 99.9% worst case default rate is 


N! (0.0132) + y 0.098N7! (0.999) 
N| ——— e } = 0.093 


y1 -— 0.098 


or 9.3% per annum. 
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Figure 11.6 Probability Distribution of Default Rate When Parameters Are Estimated Using the 
Data in Table 11.6 


11.6.3 Alternatives to the Gaussian Copula 


The one-factor Gaussian copula model has its limitations. As Figure 11.4 illustrates, it 
leads to very little tail dependence. This means that an unusually early default for one 
company does not often happen at the same time as an unusually early default for another 
company. It can be difficult to find a p to fit data. For example, there is no p that is 
consistent with a PD of 1% and the situation where one year in 10 the default rate is 
greater than 3%. Other one-factor copula models with more tail dependence can provide 
a better fit to data. 

An approach to developing other one-factor copulas is to choose F or Z,, or both, 
as distributions with heavier tails than the normal distribution in equation (11.9). (They 
have to be scaled so that they have a mean of zero and standard deviation of one.) The 
distribution of U; is then determined (possibly numerically) from the distributions of F 
and Z;. Equation (11.10) becomes 


P-1(PD) + 4/pO7!(X) 
vi-p 


where ®, O, and ¥ are the cumulative probability distributions of Z;, F, and U,, and 
8 


WCDR(T, X) = ® 


equation (11.14) becomes 


1 — p®~!(DR) — Y~! (PD) 


vp 


8 This approach is applied to evaluating the risk of tranches created from mortgages in J. Hull and A. 
White, “The Risk of Tranches Created from Mortgages,” Financial Analysts Journal 66, no. 5 (Septem- 
ber/October 2010): 54-67. It provides a better fit to historical data in many situations. Its main dis- 
advantage is that the distributions used are not as easy to deal with as the normal distribution, and 


G(DR) = ® 


numerical analysis may be necessary to determine ¥ and g(DR). 
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Summary 


Risk managers use correlations or covariances to describe the relationship between two 
variables. The daily covariance rate is the correlation between the daily returns on the 
variables multiplied by the product of their daily volatilities. The methods for monitoring 
a covariance rate are similar to those described in Chapter 10 for monitoring a variance 
rate. Risk managers often try to keep track of a variance—covariance matrix for all the 
variables to which they are exposed. 

The marginal distribution of a variable is the unconditional distribution of the vari- 
able. Very often an analyst is in a situation where he or she has estimated the marginal 
distributions of a set of variables and wants to make an assumption about their corre- 
lation structure. If the marginal distributions of the variables happen to be normal, it 
is natural to assume that the variables have a multivariate normal distribution. In other 
situations, copulas are used. The marginal distributions are transformed on a percentile- 
to-percentile basis to normal distributions (or to some other distribution for which there 
is a multivariate counterpart). The correlation structure between the variables of interest 
is then defined indirectly from an assumed correlation structure between the transformed 
variables. 

When there are many variables, analysts often use a factor model. This is a way of 
reducing the number of correlation estimates that have to be made. The correlation 
between any two variables is assumed to derive solely from their correlations with the 
factors. The default correlation between different companies can be modeled using a 
factor-based Gaussian copula model of their times to default. 

An important application of copulas for risk managers is to the calculation of the 
distribution of default rates for loan portfolios. Analysts often assume that a one-factor 
copula model relates the probability distributions of the times to default for different 
loans. The percentiles of the distribution of the number of defaults on a large portfolio 
can then be calculated from the percentiles of the probability distribution of the factor. 
As we shall see in Chapter 15, this is the approach used in determining credit risk capital 
requirements for banks under Basel II. 
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Practice Questions and Problems 
(Answers at End of Book) 


If you know the correlation between two variables, what extra information do 
you need to calculate the covariance? 
What is the difference between correlation and dependence? Suppose that y = x? 
and x is normally distributed with mean zero and standard deviation one. What 
is the correlation between x and y? 
What is a factor model? Why are factor models useful when defining a correlation 
structure between large numbers of variables? 
What is meant by a positive-semidefinite matrix? What are the implications of a 
correlation matrix not being positive-semidefinite? 
Suppose that the current daily volatilities of asset A and asset B are 1.6% and 2.5%, 
respectively. The prices of the assets at close of trading yesterday were $20 and $40 
and the estimate of the coefficient of correlation between the returns on the two 
assets made at that time was 0.25. The parameter A used in the EWMA model is 
0.95. 
(a) Calculate the current estimate of the covariance between the assets. 
(b) On the assumption that the prices of the assets at close of trading today are 
$20.50 and $40.50, update the correlation estimate. 
Suppose that the current daily volatilities of asset X and asset Y are 1.0% and 1.2%, 
respectively. The prices of the assets at close of trading yesterday were $30 and $50 
and the estimate of the coefficient of correlation between the returns on the two 
assets made at this time was 0.50. Correlations and volatilities are updated using a 
GARCH(1,1) model. The estimates of the model’s parameters are « = 0.04 and 
B = 0.94. For the correlation œ = 0.000001 and for the volatilities œ = 0.000003. 
If the prices of the two assets at close of trading today are $31 and $51, how is the 
correlation estimate updated? 
Suppose that in Problem 10.15 the correlation between the S&P 500 index (mea- 
sured in dollars) and the FTSE 100 index (measured in sterling) is 0.7, the cor- 
relation between the S&P 500 index (measured in dollars) and the dollar-sterling 
exchange rate is 0.3, and the daily volatility of the S&P 500 index is 1.6%. What is 
the correlation between the S&P 500 index (measured in dollars) and the FTSE 
100 index when it is translated to dollars? (Hint: For three variables X, Y, and Z, 
the covariance between X + Y and Z equals the covariance between X and Z 
plus the covariance between Y and Z.) 
Suppose that two variables V, and V, have uniform distributions where all values 
between 0 and 1 are equally likely. Use a Gaussian copula to define the correlation 
structure between V} and V, with a copula correlation of 0.3. Produce a table 
similar to Table 11.5 considering values of 0.25, 0.50, and 0.75 for V, and V3. (A 
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11.9 


11.10 


11.11 


11.12 


11.13 


11.14 


11.15 


11.16 


11.17 


spreadsheet for calculating the cumulative bivariate normal distribution is on the 
author’s website: www-2.rotman.utoronto.ca/~hull/riskman.) 

Assume that you have independent random samples z4, z2, and z, from a standard 
normal distribution and want to convert them to samples €,, €,, and £} from a 
trivariate normal distribution using the Cholesky decomposition. Derive three 
formulas expressing £4, €5, and €, in terms of z4, Z2, and z, and the three corre- 
lations that are needed to define the trivariate normal distribution. 

Explain what is meant by tail dependence. How can you vary tail dependence by 
the choice of copula? 

Suppose that the marginal distributions of V} and V, are standard normal distribu- 
tions but that a Student’s t-copula with four degrees of freedom and a correlation 
parameter of 0.5 is used to define the correlation between the variables. How 
would you obtain samples from the joint distribution? 

In Table 11.5, what is the probability density function of V, conditional on V} < 
0.1? Compare it with the unconditional distribution of V}. 

What is the median of the distribution of V, when V, equals 0.2 in the example 
in Tables 11.3 and 11.4? 

Suppose that a bank has made a large number of loans of a certain type. The one- 
year probability of default on each loan is 1.5% and the recovery rate is 30%. The 
bank uses a Gaussian copula for time to default. Use Vasicek’s model to estimate 
the default rate that we are 99.5% certain will not be exceeded. Assume a copula 
correlation of 0.2. 

Suppose that the default rate for a portfolio of consumer loans over the past 
10 years has been 1%, 9%, 2%, 3%, 5%, 1%, 6%, 7%, 4%, and 1%. What are the 
maximum likelihood estimates of the parameters in Vasicek’s model? 


Further Questions 


Suppose that the price of Asset X at close of trading yesterday was $300 and its 
volatility was estimated as 1.3% per day. The price of X at the close of trading today 
is $298. Suppose further that the price of Asset Y at the close of trading yesterday 
was $8, its volatility was estimated as 1.5% per day, and its correlation with X was 
estimated as 0.8. The price of Y at the close of trading today is unchanged at $8. 
Update the volatility of X and Y and the correlation between X and Y using 
(a) The EWMA model with A = 0.94 
(b) The GARCH(1,1) model with œ = 0.000002, œ = 0.04, and B = 0.94. 

In practice, is the @ parameter likely to be the same for X and Y? 


-Ax where x is 


The probability density function for an exponential distribution is Ae 
the value of the variable and A is a parameter. The cumulative probability distribu- 
tion is 1 — e~**. Suppose that two variables V, and V, have exponential distribu- 
tions with A parameters of 1.0 and 2.0, respectively. Use a Gaussian copula to define 


the correlation structure between V} and V, with a copula correlation of —0.2. 
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11.18 


11.19 


11.20 


Produce a table similar to Table 11.5 using values of 0.25, 0.5, 0.75, 1,1.25,and 1.5 
for V, and V’,. (A spreadsheet for calculating the cumulative bivariate normal dis- 
tribution is on the author’s website: www-2.rotman.utoronto.ca/~hull/riskman.) 
Create an Excel spreadsheet to produce a chart similar to Figure 11.5 showing 
samples from a bivariate Student’s f-distribution with four degrees of freedom 
where the correlation is 0.5. Next suppose that the marginal distributions of V, 
and V, are Student’s t with four degrees of freedom but that a Gaussian cop- 
ula with a copula correlation parameter of 0.5 is used to define the correlation 
between the two variables. Construct a chart showing samples from the joint dis- 
tribution. Compare the two charts you have produced. 

Suppose that a bank has made a large number of loans of a certain type. The 
one-year probability of default on each loan is 1.2%. The bank uses a Gaussian 
copula for time to default. It is interested in estimating a 99.97% worst case for 
the percent of loans that default on the portfolio. Show how this varies with the 
copula correlation. 

The default rates in the past 15 years for a certain category of loans are 2%, 4%, 
7%, 12%, 6%, 5%, 8%, 14%, 10%, 2%, 3%, 2%, 6%, 7%, and 9%. Use the maximum 
likelihood method to calculate the best fit values of the parameters in Vasicek’s 
model. What is the probability distribution of the default rate? What is the 99.9% 
worst case default rate? 


Chapter 12 


Value at Risk and Expected 
Shortfall 


hapters 8 and 9 describe how a trader responsible for a financial institution’s 

exposure to a particular market variable (e.g.,an equity index, an interest rate, or 

a commodity price) quantifies and manages risks by calculating measures such 
as delta, gamma, and vega. Often a financial institution’s portfolio depends on hundreds, 
or even thousands, of market variables. Huge numbers of these types of risk measures 
are therefore produced each day. While very useful to traders, the risk measures do not 
provide senior management and the individuals who regulate financial institutions with 
an indication of the total risk to which the financial institution is exposed. 

Value at risk (VaR) and expected shortfall (ES) are attempts to provide a single num- 
ber that summarizes the total risk in a portfolio. VaR was pioneered by JPMorgan (see 
Business Snapshot 12.1) and is now widely used by corporate treasurers and fund man- 
agers as well as by financial institutions. As Chapters 15 and 16 show, it is the measure 
regulators have traditionally used for many of the calculations they carry out concerned 
with the setting of capital requirements for market risk, credit risk, and operational risk. 
As explained in Chapter 18, regulators are switching to ES for market risk. 
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BUSINESS SNAPSHOT 12.1 
Historical Perspectives on VaR 


JPMorgan is credited with helping to make VaR a widely used measure. The 
chairman, Dennis Weatherstone, was dissatisfied with the long risk reports he 
received every day. These contained a huge amount of detail on the Greek letters 
for different exposures, but very little that was really useful to top management. 
He asked for something simpler that focused on the bank’s total exposure over 
the next 24 hours measured across the bank’s entire trading portfolio. At first his 
subordinates said this was impossible, but eventually they adapted the Markowitz 
portfolio theory (see Section 1.1) to develop a VaR report. This became known 
as the 4:15 report because it was placed on the chairman’s desk at 4:15 P.M. every 
day after the close of trading. 

Producing the report entailed an enormous amount of work involving the 
collection of data daily on the positions held by the bank around the world, the 
handling of different time zones, the estimation of correlations and volatilities, 
and the development of computer systems. The work was completed in about 
1990. The main benefit of the new system was that senior management had a 
better understanding of the risks being taken by the bank and were better able 
to allocate capital within the bank. Other banks had been working on similar 
approaches for aggregating risks and by 1993 VaR was established as an important 
risk measure. 

Banks usually keep the details about the models they develop internally a 
secret. However, in 1994 JPMorgan made a simplified version of its own system, 
which they called RiskMetrics, available on the Internet. RiskMetrics included 
variances and covariances for a very large number of different market variables. 
This attracted a lot of attention and led to debates about the pros and cons of dif- 
ferent VaR models. Software firms started offering their own VaR models, some 
of which used the RiskMetrics database. After that, VaR was rapidly adopted as 
a standard by financial institutions and some nonfinancial corporations. The BIS 
Amendment, which was based on VaR (see Section 15.6), was announced in 
1996 and implemented in 1998. Later the RiskMetrics group within JPMorgan 
was spun off as a separate company. This company developed CreditMetrics for 
handling credit risks in 1997 and CorporateMetrics for handling the risks faced 
by non-financial corporations in 1999. 


This chapter introduces the VaR and ES measures and discusses their strengths and 
weaknesses. Chapters 13 and 14 discuss how they are calculated for market risk while 
Chapter 21 considers the calculation of VaR for credit risk. 
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(100 — X)% 


Loss -V Gain 


Figure 12.1 Calculation of VaR from the Probability Distribution of the Gain in the 
Portfolio Value 
Losses are negative gains; confidence level is X%; VaR level is V. 


12.1 Definition of VaR 


When using the value at risk measure, we are interested in making a statement of the 
following form: 


“We are X percent certain that we will not lose more than V dollars in time T?” 


The variable V is the VaR of the portfolio. It is a function of two parameters: the time 
horizon, T, and the confidence level, X percent. It is the loss level during a time period 
of length T that we are X% certain will not be exceeded. 

VaR can be calculated from either the probability distribution of gains during time 
T or the probability distribution of losses during time T. (In the former case, losses are 
negative gains;in the latter case, gains are negative losses.) For example, when Tis five days 
and X = 97, VaR is the loss at the 3rd percentile of the distribution of gains over the 
next five days. Alternatively, it is the loss at the 97th percentile of the distribution of losses 
over the next five days. More generally, when the distribution of gains is used, VaR is 
equal to minus the gain at the (100 — X)th percentile of the distribution, as illustrated in 
Figure 12.1. When the distribution of losses is used, VaR is equal to the loss at the Xth 
percentile of the distribution, as indicated in Figure 12.2. 


(100 —X)% 


Gain V Loss 


Figure 12.2 Calculation of VaR from the Probability Distribution of the Loss in the 
Portfolio Value 
Gains are negative losses; confidence level is X%; VaR level is V. 
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12.2 Examples of the Calculation of VaR 


This section provides four simple examples to illustrate the calculation of VaR. In the first 
two examples, the probability distribution of the gain (or loss) is a continuous distribution. 
In the last two examples, it is a discrete distribution. 


Example 12.1 


Suppose that the gain from a portfolio during six months is normally distributed with 
a mean of $2 million and a standard deviation of $10 million. From the properties of 
the normal distribution, the one-percentile point of this distribution is 2 — 2.326 x 10 
or —$21.3 million. The VaR for the portfolio with a time horizon of six months and 
confidence level of 99% is therefore $21.3 million. 


Example 12.2 


Suppose that for a one-year project all outcomes between a loss of $50 million and a 
gain of $50 million are considered equally likely. In this case, the loss from the project 
has a uniform distribution extending from —$50 million to +$50 million. There is a 1% 
chance that there will be a loss greater than $49 million. The VaR with a one-year time 
horizon and a 99% confidence level is therefore $49 million. 


Example 12.3 


A one-year project has a 98% chance of leading to a gain of $2 million, a 1.5% chance 
of leading to a loss of $4 million, and a 0.5% chance of leading to a loss of $10 million. 
The cumulative loss distribution is shown in Figure 12.3. The point on this cumulative 
distribution that corresponds to a cumulative probability of 99% is $4 million. It follows 
that VaR with a confidence level of 99% and a one-year time horizon is $4 million. 


/ Cumulative 
probability 
0.99 - 
0.98 
0.97 + 
0.96 - 
Loss ($ million) 
0.95 T T T T 1 
-2 0 2 4 6 8 10 


Figure 12.3 Cumulative Loss Distribution for Examples 12.3 and 12.4 


Value at Risk and Expected Shortfall 273 


Example 12.4 


Consider again the situation in Example 12.3. Suppose that we are interested in calculat- 
ing a VaR using a confidence level of 99.5%. In this case, Figure 12.3 shows that all losses 
between $4 million and $10 million have a probability of 99.5% of not being exceeded. 
Equivalently, there is a probability of 0.5% of any specified loss level between $4 million 
and $10 million being exceeded. VaR is therefore not uniquely defined. One reasonable 
convention in this type of situation is to set VaR equal to the midpoint of the range of 
possible VaR values. This means that, in this case, VaR would equal $7 million. 


12.3 A Drawback of VaR 


VaR is an attractive measure because it is easy to understand. In essence, it asks the 
simple question “How bad can things get?” This is the question all senior managers want 
answered. They are very comfortable with the idea of compressing all the Greek letters 
for all the market variables underlying a portfolio into a single number. 

However, when VaR is used in an attempt to limit the risks taken by a trader, it 
can lead to undesirable results. Suppose that a bank tells a trader that the one-day 99% 
VaR of the trader’s portfolio must be limited to $10 million. The trader can construct a 
portfolio where there is a 99.1% chance that the daily loss is less than $10 million and a 
0.9% chance that it is $500 million. The trader is satisfying the risk limits imposed by the 
bank but is clearly taking unacceptable risks. The sort of probability distribution of gains 
that the trader might aim for is shown in Figure 12.4. The VaR in Figure 12.4 might be 
the same as the VaR in Figure 12.1. But the portfolio in Figure 12.4 is much riskier than 
the portfolio in Figure 12.1 because a large loss is more likely. 

It might be thought that a probability distribution such as that in Figure 12.4 would 
never occur in practice. In fact it is not that unusual. Many trading strategies give a 
high probability of good returns and a small probability of a huge loss. (For example, 
writing out-of-the-money options is a strategy where most of the time the trader collects 
the option premium and does not have to provide a payoff to the option buyer. But 


(100 - X)% 


Loss -V Gain 


Figure 12.4 Probability Distribution for Gain in Portfolio Value during Time T 
Confidence Level is X%. Portfolio has the same VaR level, V, as in Figure 12.1, but a large loss is 
more likely. 
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occasionally the option is exercised in circumstances where the trader takes a big loss.) 
Many traders like taking high risks in the hope of realizing high returns. If they can find 
ways of taking high risks without violating risk limits, they will do so. To quote one 
trader the author has talked to: “I have never met a risk control system that I cannot 
trade around.” 


12.4 Expected Shortfall 


A measure that can produce better incentives for traders than VaR is expected shortfall (ES). 
This is also sometimes referred to as conditional value at risk, conditional tail expectation, or 
expected tail loss. Whereas VaR asks the question: “How bad can things get?” ES asks: “If 
things do get bad, what is the expected loss?” ES, like VaR, is a function of two parameters: 
T (the time horizon) and X (the confidence level). Indeed, in order to calculate ES it is 
necessary to calculate VaR first. ES is the expected loss during time T conditional on the 
loss being greater than the VaR. For example, suppose that X = 99, T is 10 days, and the 
VaR is $64 million. The ES is the average amount lost over a 10-day period assuming 
that the loss is greater than $64 million. 

Setting an ES limit rather than a VaR limit for traders makes it less likely that they 
will be able to take the sort of position indicated by Figure 12.4. Also, as shown in the 
next section, ES has better properties than VaR in that it always recognizes the benefits 
of diversification. One disadvantage is that it does not have the simplicity of VaR and as 
a result is more difficult to understand. Another is that it is more difficult to back-test 
a procedure for calculating ES than it is to back-test a procedure for calculating VaR. 
(Back-testing, as will be explained later, is a way of looking at historical data to test the 
reliability of a particular methodology for calculating a risk measure.) 


12.5 Coherent Risk Measures 


Suppose that the VaR of a portfolio for a confidence level of 99.9% and a time horizon 
of one year is $50 million. This means that in extreme circumstances (theoretically, once 
every thousand years) the financial institution will lose more than $50 million in a year. 
It also means that if it keeps $50 million in capital it will have a 99.9% probability of not 
running out of capital in the course of one year. 

Suppose we are trying to design a risk measure that will equal the capital a financial 
institution is required to keep. Is VaR (with an appropriate time horizon and an appro- 
priate confidence level) the best measure? Artzner et al. have examined this question. 
They first proposed a number of properties that such a risk measure should have. 
These are: 


'See P Artzner, E Delbaen, J.-M. Eber, and D. Heath, “Coherent Measures of Risk,” Mathematical 
Finance 9 (1999): 203-228. 
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1. Monotonicity: If a portfolio produces a worse result than another portfolio for every 
state of the world, its risk measure should be greater. 

2. Translation Invariance: If an amount of cash K is added to a portfolio, its risk measure 
should go down by K. 

3. Homogeneity: Changing the size of a portfolio by a factor A while keeping the relative 
amounts of different items in the portfolio the same should result in the risk measure 
being multiplied by À. 

4. Subadditivity: The risk measure for two portfolios after they have been merged should 
be no greater than the sum of their risk measures before they were merged. 


The first condition is straightforward. If one portfolio always performs worse than 
another portfolio, it clearly should be viewed as more risky and require more capital. The 
second condition is also reasonable. If we add an amount of cash equal to K to a portfolio, 
the cash provides a buffer against losses and should reduce the capital requirement by K. 
The third condition is also reasonable. If we double the size of a portfolio, presumably 
we should require twice as much capital? The fourth condition states that diversification 
helps reduce risks. When we aggregate two portfolios, the total risk measure should either 
decrease or stay the same. 

VaR satisfies the first three conditions. However, it does not always satisfy the fourth 
one, as is illustrated by the following two examples. 


Example 12.5 


Suppose each of two independent projects has a probability of 0.02 ofa loss of $10 million 
and a probability of 0.98 of a loss of $1 million during a one-year period. The one-year, 
97.5% VaR for each project is $1 million. When the projects are put in the same portfolio, 
there is a 0.02 x 0.02 = 0.0004 probability of a loss of $20 million, a 2 x 0.02 x 0.98 = 
0.0392 probability of a loss of $11 million, and a 0.98 Xx 0.98 = 0.9604 probability of a 
loss of $2 million. The one-year 97.5% VaR for the portfolio is $11 million. The total 
of the VaRs of the projects considered separately is $2 million. The VaR of the portfolio 
is therefore greater than the sum of the VaRs of the projects by $9 million. This violates 
the subadditivity condition. 


Example 12.6 


A bank has two $10 million one-year loans. The probabilities of default are as indicated 
in the following table. 


? This is true provided the portfolio is not too large. As a portfolio’s size increases, it becomes less 
liquid and proportionally more capital may be required. 
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Outcome Probability 
Neither loan defaults 97.50% 
Loan 1 defaults; Loan 2 does not default 1.25% 
Loan 2 defaults; Loan 1 does not default 1.25% 
Both loans default 0.00% 


If a default occurs, all losses between 0% and 100% of the principal are equally likely. If 
the loan does not default, a profit of $0.2 million is made. 

Consider first Loan 1. This has a 1.25% chance of defaulting. When a default occurs 
the loss experienced is evenly distributed between zero and $10 million. This means 
that there is a 1.25% chance that a loss greater than zero will be incurred; there is a 
0.625% chance that a loss greater than $5 million is incurred; there is no chance of a loss 
greater than $10 million. The loss level that has a probability of 1% of being exceeded 
is $2 million. (Conditional on a loss being made, there is an 80% or 0.8 chance that the 
loss will be greater than $2 million. Because the probability of a loss is 1.25% or 0.0125, 
the unconditional probability of a loss greater than $2 million is 0.8 X 0.0125 = 0.01 or 
1%.) The one-year 99% VaR is therefore $2 million. The same applies to Loan 2. 

Consider next a portfolio of the two loans. There is a 2.5% probability that a default 
will occur. As before, the loss experienced on a defaulting loan is evenly distributed 
between zero and $10 million. The VaR in this case turns out to be $5.8 million. This 
is because there is a 2.5% (0.025) chance of one of the loans defaulting and conditional 
on this event is a 40% (0.4) chance that the loss on the loan that defaults is greater than 
$6 million. The unconditional probability of a loss from a default being greater than 
$6 million is therefore 0.4 x 0.025 = 0.01 or 1%. In the event that one loan defaults, a 
profit of $0.2 million is made on the other loan, showing that the one-year 99% VaR is 
$5.8 million. 

The total VaR of the loans considered separately is 2 + 2 = $4 million. The total VaR 
after they have been combined in the portfolio is $1.8 million greater at $5.8 million. 
This shows that the subadditivity condition is violated. (This is in spite of the fact that 
there are clearly very attractive diversification benefits from combining the loans into a 
single portfolio—particularly because they cannot default together.) 


Risk measures satisfying all four conditions given above are referred to as coherent. 
Examples 12.5 and 12.6 illustrate that VaR is not coherent. It can be shown that the ES 
measure is always coherent. The following examples illustrate this. 


Example 12.7 


Consider again the situation in Example 12.5. The VaR for one of the projects considered 
on its own is $1 million. To calculate the ES for a 97.5% confidence level we note that, 
of the 2.5% tail of the loss distribution, 2% corresponds to a $10 million loss and 0.5% 
to a $1 million loss. (Note that the other 97.5% of the distribution also corresponds to a 
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loss of $1 million.) Conditional that we are in the 2.5% tail of the loss distribution, there 
is therefore an 80% probability of a loss of $10 million and a 20% probability of a loss of 
$1 million. The expected loss is 0.8 X 10 + 0.2 X 1 or $8.2 million. 

When the two projects are combined, of the 2.5% tail of the loss distribution, 0.04% 
corresponds to a loss of $20 million and 2.46% corresponds to a loss of $11 million. 
Conditional that we are in the 2.5% tail of the loss distribution, the expected loss is 
therefore (0.04/2.5) x 20 + (2.46/2.5) X 11 or $11.144 million. This is the ES. 

Because 8.2 + 8.2 > 11.144, the ES measure does satisfy the subadditivity condition 
for this example. 


Example 12.8 


Consider again the situation in Example 12.6. We showed that the VaR for a single loan 
is $2 million. The ES from a single loan when the time horizon is one year and the 
confidence level is 99% is therefore the expected loss on the loan conditional on a loss 
greater than $2 million. Given that losses are uniformly distributed between zero and 
$10 million, the expected loss conditional on a loss greater than $2 million is halfway 
between $2 million and $10 million, or $6 million. 

The VaR for a portfolio consisting of the two loans was calculated in Example 12.6 
as $5.8 million. The ES from the portfolio is therefore the expected loss on the portfolio 
conditional on the loss being greater than $5.8 million. When one loan defaults, the 
other (by assumption) does not and outcomes are uniformly distributed between a gain 
of $0.2 million and a loss of $9.8 million. The expected loss, given that we are in the 
part of the distribution between $5.8 million and $9.8 million, is $7.8 million. This is 
therefore the ES of the portfolio. 

Because $7.8 million is less than 2 x $6 million, the ES measure does satisfy the 
subadditivity condition. 


The subadditivity condition is not of purely theoretical interest. Occasionally a bank 
finds that, when it combines two portfolios (e.g., its equity portfolio and its fixed-income 
portfolio), the total VaR goes up. 


12.5.1 Exponential Spectral Risk Measures 


A risk measure can be characterized by the weights it assigns to percentiles of the loss 
distribution? VaR gives a 100% weighting to the Xth percentiles and zero to other 
percentiles. ES gives equal weight to all percentiles greater than the Xth percentile and 
zero weight to all percentiles below the Xth percentile. We can define other risk measures 
by making other assumptions about the weights assigned to percentiles. A general result is 


3 Percentiles are also referred to as quantiles or fractiles. 
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Expected shortfall 
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Figure 12.5 Weights as a Function of Percentiles 
(a) Expected shortfall when X = 90%, (b) exponential spectral risk measure with y = 0.15, and (c) 
exponential spectral risk measure with y = 0.05 


that a risk measure is coherent (1.e., it satisfies the subadditivity condition) if the weight 
assigned to the qth percentile of the loss distribution is a nondecreasing function of 
q. ES satisfies this condition. However, VaR does not, because the weights assigned to 
percentiles greater than X are less than the weight assigned to the Xth percentile. Some 
researchers have proposed measures where the weights assigned to the qth percentile 
of the loss distribution increase relatively fast with q. One idea is to make the weight 
assigned to the qth percentile proportional to e~'-9/Y where y is a constant. This is 
referred to as the exponential spectral risk measure. Figure 12.5 shows the weights assigned 
to loss percentiles for ES and for the exponential spectral risk measure when y has two 
different values. 


12.6 Choice of Parameters for VaR and ES 


For VaR and ES, a user must choose two parameters: the time horizon and the confidence 
level. A simple assumption is that the change in the portfolio value at the time horizon is 
normally distributed. As explained in Section 10.3, this is not usually a good assumption. 
However, it is useful for us to consider the consequences of the assumption at this stage. 
When the loss in the portfolio value has a mean of p and a standard deviation of 6, 


VaR = p + 6N7!(X) (12.1) 


where X is the confidence level and N7'(.) is the inverse cumulative normal distribu- 
tion (which can be calculated using NORMSINV in Excel). For relatively short time 
horizons, p is often assumed to be zero. VaR for a particular confidence level is then 
proportional to ©. 
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Example 12.9 


Suppose that the loss from a portfolio over a 10-day time horizon is normal with a mean 
of zero and a standard deviation of $20 million. The 10-day 99% VaR is 


20N7!(0.99) = 46.5 


or $46.5 million. 


When the loss is assumed to be normally distributed with mean p and standard 
deviation ©, ES with a confidence level of X is given by 


FYR 
ES = p +o — (12.2) 


V2n(1 — X) 


where Y is the Xth percentile point of the standard normal distribution (i.e., it is the 
point on a normal distribution with mean zero and standard deviation one that has a 
probability 1 — X of being exceeded). This shows that when p is assumed to be zero, ES, 
like VaR, is proportional to o. 


Example 12.10 


Consider again the situation in Example 12.9 where the loss from a portfolio over a 10- 
day time horizon is normally distributed with a mean of zero and a standard deviation 
of $20 million. Because 2.326 is the point on a standard normal distribution that has a 
1% chance of being exceeded, the 10-day 99% ES is 


72-3267 /2 
20———— = 53.3 


/2n x 0.01 


or $53.3 million. 


12.6.1 The Time Horizon 


An appropriate choice for the time horizon, when VaR or ES is calculated, depends on 
the application. When positions are very liquid and actively traded, it makes sense to use 
a short time horizon (perhaps only a few days). If the measure calculated turns out to 
be unacceptable, the portfolio can be adjusted fairly quickly. Also, a longer time horizon 
might not be meaningful because of changes in the composition of the portfolio. 
When VaR or ES is being calculated by the manager of a pension fund, a longer 
time horizon 1s likely to be used. This is because the portfolio is traded less actively and 
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some of the instruments in the portfolio are less liquid. When the liquidity of a portfolio 
varies from one instrument to another, the definition of VaR or ES can be changed 
so that the changes considered vary from one market variable to another. Consider, for 
example, a portfolio consisting of shares in IBM and a corporate bond that trades fewer 
than 10 times per year. It could make sense to calculate a risk measure from the change 
in the price of IBM that we are 99% confident will not be exceeded over 10 days and 
the change in the bond price that we are 99% certain will not be exceeded over 60 days. 
This is the approach used by regulators in the Fundamental Review of the Trading Book, 
which is discussed in Chapter 18. 

Whatever the application, when market risks are being considered, analysts often start 
by calculating VaR or ES for a time horizon of one day. The usual assumption is 


T-day VaR = 1-day VaR x VT (12.3) 
T-day ES = 1-day ES x VT (12.4) 


These formulas are exactly true when the changes in the value of the portfolio on suc- 
cessive days have independent identical normal distributions with mean zero. In other 
cases, they are approximations. The formulas follow from equations (12.1) and (12.2) and 
the following results. 


1. The standard deviation of the sum on T independent identical distributions is y T 
times the standard deviation of each distribution. 
2. The sum of the independent normal distributions is normal. 


12.6.2 Impact of Autocorrelation 


In practice, the changes in the value of a portfolio from one day to the next are not 
always totally independent. Define AP, as the change in the value of a portfolio on day 
i. A simple assumption is first-order autocorrelation where the correlation between AP, 
and AP, is p for all i. Suppose that the variance of AP, is o° for all i. Using the usual 
formula for the variance of the sum of two variables, the variance of AP,_, + AP, is 


o? +07 + 2p0° = 2(1 + p)o” 


The correlation between AP,_; and AP, is pi. Extending the analysis leads to the following 
formula for the standard deviation of er AP, (see Problem. 12.11): 


ov T + 2(T —1)p + 2(T — 2)p? + 2(T — 3)p> +... 2p7-! (12.5) 


Table 12.1 shows the impact of autocorrelation on the ratio of the T-day VaR (ES) to the 
one-day VaR (ES). It assumes that the distributions of daily changes in the portfolio are 
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Table 12.1 Ratio of T-Day VaR (ES) to One-Day VaR (ES) for Different Values of T When 
There Is First-Order Correlation and Daily Changes Have Identical Normal Distributions with 
Mean Zero. 


T=1 T=2 T=5 T= 10 T=50 T = 250 
p=0 1.00 1.41 2.24 3.16 7.07 15.81 
p = 0.05 1.00 1.45 2.33 3.31 7.43 16.62 
p=0.1 1.00 1.48 2.42 3.46 7.80 17.47 
p = 0.2 1.00 1.55 2.62 3.79 8.62 19.35 


identical normals with mean zero. Note that the ratio of the T-day VaR (ES) to the one- 
day VaR (ES) does not depend on the daily standard deviation, ©, or the confidence level. 
This follows from the results in equations (12.1) and (12.2) and the property of equation 
(12.5) that the T-day standard deviation 1s proportional to the one-day standard deviation. 
Comparing the p = 0 row in Table 12.1 with the other rows shows that the existence 
of autocorrelation results in the VaR and ES estimates calculated from equations (12.3) 
and (12.4) being too low. 


Example 12.11 


Suppose that daily changes in a portfolio value are normally distributed with mean zero 
and standard deviation $3 million. The first-order autocorrelation of daily changes is 0.1. 
From equation (12.5), the standard deviation of the change in the portfolio value over 
five days is 


3V¥5+2x4x014+2x*3x0124+2x2x01°4+2x1x0.14 = 7.265 


The five-day 95% VaR is therefore 7.265 x N7!(0.95) = 11.95 or $11.95 million. The 
five-day ES is 


e71.6457/2 
7.265 x —————_ = 14.98 


V27 x 0.05 


Note that the ratio of the five-day standard deviation of portfolio changes to the one-day 
standard deviation is 7.265/3 = 2.42. This is the number in Table 12.1 for p = 0.1 and 
T =5; 


12.6.3 Confidence Level 


The confidence level chosen for VaR or ES is likely to depend on a number of factors. 
Suppose that a bank wants to maintain an AA credit rating and calculates that companies 
with this credit rating have a 0.03% chance of defaulting over a one-year period. It might 
choose to use a 99.97% confidence level in conjunction with a one-year time horizon 
when calculating VaR for internal risk management purposes. Suppose, for example, that 
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the one-year 99.97% VaR across all exposures is $5 billion. This means that with $5 billion 
of capital the bank will have a 0.03% chance of becoming insolvent (i.e., running out 
of equity) during one year. This type of analysis might be communicated by banks to 
rating agencies in an attempt to convince the rating agency that the bank deserves its AA 
rating. 

The confidence level that is actually used for the first VaR or ES calculation is some- 
times much less than the one that is required. This is because it is very difficult to estimate 
a VaR directly when the confidence level is very high. A general approach for increas- 
ing the confidence level is extreme value theory, discussed in the next chapter. If daily 
portfolio changes are assumed to be normally distributed with zero mean, we can use 
equations (12.1) and (12.2) to convert a VaR or ES calculated with one confidence 
level to that with another confidence level. For example, suppose that © is the standard 
deviation of the change in the portfolio value over a certain time horizon and that the 
expected change in the portfolio value is zero. Denote VaR and ES for a confidence level 
of X by VaR(X) and ES(X), respectively. From equation (12.1) 


VaR (X) = oN“! (X) 


for all confidence levels X. It follows that a VaR with a confidence level of X* can be 
calculated from a VaR with a lower confidence level of X using 


‘ NX) 
VaR (X") = VaR(X)——— (12.6) 
N=1(X) 
Similarly, from equation (12.2) 
(1 = Her US ye 
ES(X*) = ES(X)- (12.7) 


1= X" 


where Y and Y* are the points on the standard normal distribution that have probabilities 
1 — X and 1 — X* of being exceeded. 

Equations (12.6) and (12.7) assume that the two VaR/ES measures have the same 
time horizon. If we want to change the time horizon and the confidence level, we can 
use the equations in conjunction with equation (12.3) or (12.4). 


Example 12.12 


Suppose that the one-day VaR with a confidence level of 95% is $1.5 million and the 
one-day expected shortfall is $2 million. Using the assumption that the distribution of 
changes in the portfolio value is normal with mean zero, equation (12.6) gives the one- 
day 99% VaR as 


1.5 X mas = 2.12 
1.645 
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or $2.12 million. Equation (12.7) gives the one-day 99% ES as 


0.05 —(2.326-1.645)x(2.326+1.645)/2 — 2.58 
0.01 


2X 


or $2.58 million. 


12.7 Marginal, Incremental, and Component Measures 


Consider a portfolio that is composed of a number of subportfolios. The subportfolios 
could correspond to asset classes (e.g., domestic equities, foreign equities, fixed income, 
and derivatives). They could correspond to the different business units (e.g, retail banking, 
investment banking, and proprietary trading). They could even correspond to individual 
trades. Analysts sometimes calculate measures of the contribution of each subportfolio 
to VaR or ES. 

Suppose that the amount invested in ith subportfolio is x;. The marginal value at 
risk for the ith subportfolio is the sensitivity of VaR to the amount invested in the ith 
subportfolio. It is 


VaR 
Ox, 


1 


To estimate marginal VaR, we can increase x; to x; + Ax; for a small Ax; and recalculate 
VaR. If AVaR is the increase in VaR, the estimate of marginal VaR is AVaR /Ax,. For a 
well-diversified investment portfolio, marginal VaR is closely related to the capital asset 
pricing model’s beta (see Section 1.3). If an asset’s beta is high, its marginal VaR will 
tend to be high. If its beta is low, the marginal VaR tends to be low. In some circum- 
stances, marginal VaR is negative indicating that increasing the weighting of a particular 
subportfolio reduces the risk of the portfolio. 

The incremental value at risk for the ith subportfolio is the incremental effect of the ith 
subportfolio on VaR. It is the difference between VaR with the subportfolio and VaR 
without the subportfolio. Traders are often interested in the incremental VaR for a new 
trade. 

The component value at risk for the ith subportfolio is 


6a (12.8) 


This can be approximated as 
AVaR 
Se 
Ax; ' 


1 


It can be calculated by making a small percentage change y; = Ax;/x; in the amount 
invested in the ith subportfolio and recalculating VaR. If AVaR is the increase in VaR, 
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the estimate of component VaR is AVaR/y;. In many situations, component VaR is a 
reasonable approximation to incremental VaR. This is because, if a subportfolio is small 
in relation to the size of the whole portfolio, it can be assumed that the marginal VaR 
remains constant as x; is reduced all the way to zero. When this assumption is made, the 
impact of reducing x; to zero is x; times the marginal VaR—which is the component VaR. 

Marginal ES, incremental ES, and component ES can be defined similarly to marginal 
VaR, incremental VaR, and component VaR, respectively. 


12.8 Euler’s Theorem 


A result produced by the great mathematician Leonhard Euler many years ago turns 
out to be very important when a risk measure for a whole portfolio is allocated to 
subportfolios. Suppose that V is a risk measure for a portfolio and x; is a measure of the 
size of the ith subportfolio (1 < i < M). Assume that, when x; is changed to Ax; for all x; 
(so that the size of the portfolio is multiplied by A), V changes to AV. This corresponds 
to the third condition in Section 12.5 and is known as linear homogeneity. It is true for 
most risk measures.* 


Euler’s theorem shows that it is then true that 
y= — x, (12.9) 


This result provides a way of allocating V to the subportfolios. 
When the risk measure is VaR, Euler’s theorem gives 


M 
VaR =)! G 
i=1 


where C;, as in equation (12.8), is the component VaR for the ith subportfolio. This 
shows that the total VaR for a portfolio is the sum of the component VaRs for the 
subportfolios. Component VaRs are therefore a convenient way of allocating a total VaR 
to subportfolios. As explained in the previous section, component VaRs also have the 
attractive property that the ith component VaR for a large portfolio is approximately 
equal to the incremental VaR for that component. 

When the risk measure is ES, Euler’s theorem similarly shows that the total ES is the 
sum of the component ESs: 


* An exception could be a risk measure that incorporates liquidity. As a portfolio becomes larger, its 
liquidity declines. 
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ES can therefore be allocated to the component parts of a business similarly to VaR. In 
Chapter 26, we will show how Euler’s theorem is used to allocate a bank’s economic 
capital to its business units. 

Euler’s theorem allows risk to be decomposed into its components. It is a useful 
tool in determining risk in what is referred to as risk budgeting. This is concerned with 
the amount of risk that should be allocated to different components of a portfolio. If 
Euler’s decomposition shows that an unacceptable percentage of the risk is attributable 
to a particular component, the portfolio can be rebalanced. 


12.9 Aggregating VaRs and ESs 


Sometimes a business has calculated VaRs, with the same confidence level and time 
horizon, for several different segments of its operations and is interested in aggregating 
them to calculate a total VaR. A formula for doing this is 


DD VaR VaR p; 
J 


where VaR; is the VaR for the ith segment, VaR is the total VaR, and Pij is the 
correlation between losses from segment i and segment j. This is exactly true when the 


VaR (12.10) 


total 


total 


losses (gains) have zero-mean normal distributions and provides a good approximation in 
many other situations. The same is true when VaR is replaced by ES in equation (12.10). 


Example 12.13 


Suppose the ESs calculated for two segments of a business are $60 million and $100 mil- 
lion. The correlation between the losses is estimated as 0.4. An estimate of the total 


ES is 
y 602 + 1002 + 2 x 60 x 100 x 0.4 = 135.6 


12.10 Back-Testing 


Back-testing is an important reality check for a risk measure. It is a test of how well the 
current procedure for calculating the measure would have worked in the past. VaR is 
easier to back-test than ES. No doubt this is one of the reasons why regulators have in 
the past been reluctant to switch from VaR to ES for market risk. As we will explain 
in Chapter 18, their future plans involve using ES to determine regulatory capital, but 
back-testing using VaR estimates. 

Suppose that we have developed a procedure for calculating a one-day 99% VaR. 
Back-testing involves looking at how often the loss in a day would have exceeded the 
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one-day 99% VaR when the latter is calculated using the current procedure. Days when 
the actual loss exceeds VaR are referred to as exceptions. If exceptions happen on about 
1% of the days, we can feel reasonably comfortable with the current methodology for 
calculating VaR. If they happen on, say, 7% of days, the methodology is suspect and it is 
likely that VaR is underestimated. From a regulatory perspective, the capital calculated 
using the current VaR estimation procedure is then too low. On the other hand, if excep- 
tions happen on, say, 0.3% of days, it is likely that the current procedure is overestimating 
VaR and the capital calculated is too high. 

One issue in back-testing a one-day VaR is whether we should consider changes 
made in the portfolio during a day. There are two possibilities. The first is to compare 
VaR with the hypothetical change in the portfolio value calculated on the assumption 
that the composition of the portfolio remains unchanged during the day. The other is 
to compare VaR to the actual change in the value of the portfolio during the day. The 
assumption underlying the calculation of VaR is that the portfolio will remain unchanged 
during the day and so the first comparison based on hypothetical changes is more theo- 
retically correct. However, it is actual changes in the portfolio value that we are ultimately 
interested in. In practice, risk managers usually compare VaR to both hypothetical port- 
folio changes and actual portfolio changes (and regulators usually insist on seeing the 
results of back-testing using both types of changes). The actual changes are adjusted for 
items unrelated to the market risk, such as fee income and profits from trades carried out 
at prices different from the mid-market. 

Suppose that the confidence level for a one-day VaR is X%. If the VaR model used 
is accurate, the probability of the VaR being exceeded on any given day is p= 1 — 
X/100. Suppose that we look at a total of n days and we observe that the VaR level 
is exceeded on m of the days where m/n > p. Should we reject the model for producing 
values of VaR that are too low? Expressed formally, we can consider two alternative 
hypotheses: 


1. The probability of an exception on any given day is p. 
2. The probability of an exception on any given day is greater than p. 


From the properties of the binomial distribution, the probability of the VaR level 
being exceeded on m or more days is 


n 


n! k n—k 
ee i EE 
rea p) 


k=m 


This can be calculated using the BINOMDIST function in Excel. An often-used signif- 
icance level in statistical tests is 5%. If the probability of the VaR level being exceeded 
on m or more days is less than 5%, we reject the first hypothesis that the probability of 
an exception is p. If the probability of the VaR level being exceeded on m or more days 
is greater than 5%, the hypothesis is not rejected. 
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Example 12.14 


Suppose that we back-test a VaR model using 600 days of data. The VaR confidence level 
is 99% and we observe nine exceptions. The expected number of exceptions is six. Should 
we reject the model? The probability of nine or more exceptions can be calculated in 
Excel as 1— BINOMDIST(8,600,0.01, TRUE). It is 0.152. At a 5% significance level we 
should not therefore reject the model. However, if the number of exceptions had been 12 
we would have calculated the probability of 12 or more exceptions as 0.019 and rejected 
the model. The model is rejected when the number of exceptions is 11 or more. (The 
probability of 10 or more exceptions is greater than 5%, but the probability of 11 or 
more is less than 5%.) 


When the number of exceptions, m, is lower than the expected number of exceptions, 
we can similarly test whether the true probability of an exception is 1%. (In this case, our 
alternative hypothesis is that the true probability of an exception is less than 1%.) The 
probability of m or fewer exceptions is 


m 


n! k n-k 
te 
Daa p) 


k=0 i 


and this is compared with the 5% threshold. 


Example 12.15 


Suppose again that we back-test a VaR model using 600 days of data when the VaR 
confidence level is 99% and we observe one exception, well below the expected number 
of six. Should we reject the model? The probability of one or zero exceptions can be 
calculated in Excel as BINOMDIST(1,600,0.01, TRUE). It is 0.017. At a 5% significance 
level, we should therefore reject the model. However, if the number of exceptions had 
been two or more, we would not have rejected the model. 


The tests we have considered so far have been one-tailed tests. In Example 12.14, we 
assumed that the true probability of an exception was either 1% or greater than 1%. In 
Example 12.15, we assumed that it was 1% or less than 1%. Kupiec (1995) has proposed 
a relatively powerful two-tailed test If the probability of an exception under the VaR 
model is p and m exceptions are observed in n trials, then 


—2ln[(1 — p)” "p"] + 21ln[(1 — m/n)" (m/n)"] (12.11) 


>See P. Kupiec, “Techniques for Verifying the Accuracy of Risk Management Models,” Journal of 
Derivatives 3 (1995): 73—84. 
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should have a chi-square distribution with one degree of freedom. Values of the statistic 
are high for either very low or very high numbers of exceptions. There is a probability 
of 5% that the value of a chi-square variable with one degree of freedom will be greater 
than 3.84. It follows that we should reject the model whenever the expression in equation 
(12.11) is greater than 3.84. 


Example 12.16 


Suppose that, as in the previous two examples, we back-test a VaR model using 600 days 
of data when the VaR confidence level is 99%. The value of the statistic in equation 
(12.11) is greater than 3.84 when the number of exceptions, m, is one or less and when 
the number of exceptions is 12 or more. We therefore accept the VaR model when 
2<m< 11 and reject it otherwise. 


Generally speaking, the difficulty of back-testing a VaR model increases as the VaR 
confidence level increases. This is an argument in favor of using relatively low confi- 
dence levels for VaR for back-testing purposes and then using extreme value theory (see 
Chapter 13) to obtain the required confidence level. 


12.10.1 Bunching 


A separate issue from the number of exceptions is bunching. If daily portfolio changes are 
independent, exceptions should be spread evenly throughout the period used for back- 
testing. In practice, they are often bunched together, suggesting that losses on successive 
days are not independent. One approach to testing for bunching is to use the following 
statistic, suggested by Christoffersen (1998) £ 


=> Inj _ gr) toot tio gor tr] +2 In{(1 _ Rp) oN, (1 _ Ri) a | 


where u; is the number of observations in which we go from a day where we are in state 
i to a day where we are in state j. This statistic is chi-square with one degree of freedom 
if there is no bunching. State 0 is a day where there is no exception while state 1 is a day 
where there is an exception. Also, 


Uo, + U11 
t= --_ 
Uoo F Uo, F H10 + t1 
i Uo1 
OS 
Uoo F uo 
Uy 
Ty = ———— 
11 + 
Uio T Uy 


®See P F Christoffersen, “Evaluating Interval Forecasts,” International Economic Review 39 (1998): 
841-862. 
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Summary 


A value at risk (VaR) calculation is aimed at making a statement of the form: “We are 
X percent certain that we will not lose more than V dollars in time T.” The variable V 
is the VaR, X percent is the confidence level, and T is the time horizon. It has become 
a very popular risk measure. An alternative measure that provides better incentives for 
traders and has rather better theoretical properties is expected shortfall (ES). This is the 
expected loss conditional on the loss being greater than the VaR level. As Chapter 18 
explains, regulators are switching from VaR to ES for market risk measurement. 

When changes in a portfolio value are normally distributed, it is easy to calculate 
VaR and ES from the mean and standard deviation of the change in the portfolio value 
during time T. If one-day changes in the value have zero-mean independent normal 
distributions, a T-day VaR (ES) equals the one-day VaR (ES) multiplied by VT . When 
the independence assumption is relaxed, other somewhat more complicated formulas 
can be used to go from the one-day VaR to the N-day VaR. In practice, losses often have 
heavier tails than the normal distribution. The power law is a way of modeling the tail 
of a distribution from empirical data. The theoretical basis for this approach is extreme 
value theory, which will be discussed in the next chapter. 

Consider the situation where a portfolio has a number of subportfolios. The marginal 
value of a risk measure (VaR or ES) with respect to the ith subportfolio is the partial 
derivative of the risk measure with respect to the size of the subportfolio. The incre- 
mental VaR (ES) with respect to a particular subportfolio is the incremental effect of 
that subportfolio on VaR (ES). There is a formula that can be used for dividing VaR 
(ES) into components that correspond to the positions taken in the subportfolios. The 
component VaRs (ESs) sum to VaR (ES), and each component is, for a large portfolio 
of relatively small positions, approximately equal to the corresponding incremental VaR 
(ES). 

Back-testing is an important activity. It examines how well a particular model for 
calculating a risk measure would have performed in the past. It is relatively easy to carry 
out for VaR. Back-testing may indicate weaknesses in a VaR model if the percentage of 
exceptions (that is, the percentage of times the actual loss exceeds VaR) is much greater 
or much less than that expected. There are statistical tests to determine whether a VaR 
model should be rejected because of the percentage of exceptions. As we will see in 
Chapter 15, regulators have rules for increasing market risk capital if they consider the 
results from back-testing over 250 days to be unsatisfactory. 
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12.2 


12.3 


12.4 


12.5 


12.6 


12.7 


Practice Questions and Problems 
(Answers at End of Book) 


What is the difference between expected shortfall and VaR? What is the theoret- 

ical advantage of expected shortfall over VaR? 

What conditions must be satisfied by the weights assigned to percentiles in a risk 

measure for the subadditivity condition in Section 12.5 to be satisfied? 

A fund manager announces that the fund’s one-month 95% VaR is 6% of the size 

of the portfolio being managed. You have an investment of $100,000 in the fund. 

How do you interpret the portfolio manager’s announcement? 

A fund manager announces that the fund’s one-month 95% expected shortfall is 

6% of the size of the portfolio being managed. You have an investment of $100,000 

in the fund. How do you interpret the portfolio manager’s announcement? 

Suppose that each of two investments has a 0.9% chance of a loss of $10 million 

and a 99.1% chance of a loss of $1 million. The investments are independent of 

each other. 

(a) What is the VaR for one of the investments when the confidence level is 99%? 

(b) What is the expected shortfall for one of the investments when the confidence 
level is 99%? 

(c) What is the VaR for a portfolio consisting of the two investments when the 
confidence level is 99%? 

(d) What is the expected shortfall for a portfolio consisting of the two investments 
when the confidence level is 99%? 

(e) Show that in this example VaR does not satisfy the subadditivity condition, 
whereas expected shortfall does. 

Suppose that the change in the value of a portfolio over a one-day time period is 

normal with a mean of zero and a standard deviation of $2 million; what is (a) the 

one-day 97.5% VaR, (b) the five-day 97.5% VaR, and (c) the five-day 99% VaR? 

What difference does it make to your answer to Problem 12.6 if there is first-order 

daily autocorrelation with a correlation parameter equal to 0.16? 
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12.13 


12.14 


12.15 


12.16 


12.17 


Explain carefully the differences between marginal VaR, incremental VaR, and 
component VaR for a portfolio consisting of a number of assets. 

Suppose that we back-test a VaR model using 1,000 days of data. The VaR con- 
fidence level is 99% and we observe 17 exceptions. Should we reject the model 
at the 5% confidence level? Use a one-tailed test. 

Explain what is meant by bunching. 

Prove equation 12.5. 

The change in the value of a portfolio in one month is normally distributed with 
a mean of zero and a standard deviation of $2 million. Calculate the VaR and ES 
for a confidence level of 98% and a time horizon of three months. 


Further Questions 


Suppose that each of two investments has a 4% chance of a loss of $10 million, 

a 2% chance of a loss of $1 million, and a 94% chance of a profit of $1 million. 

They are independent of each other. 

(a) What is the VaR for one of the investments when the confidence level is 95%? 

(b) What is the expected shortfall when the confidence level is 95%? 

(c) What is the VaR for a portfolio consisting of the two investments when the 
confidence level is 95%? 

(d) What is the expected shortfall for a portfolio consisting of the two investments 
when the confidence level is 95%? 

(e) Show that, in this example, VaR does not satisfy the subadditivity condition, 
whereas expected shortfall does. 

Suppose the first-order autocorrelation for daily changes in the value of a port- 

folio is 0.12. The 10-day VaR, calculated by multiplying the one-day VaR by 

10, is $2 million. What is a better estimate of the VaR that takes account of 

autocorrelation? 

Suppose that we back-test a VaR model using 1,000 days of data. The VaR con- 

fidence level is 99% and we observe 15 exceptions. Should we reject the model 

at the 5% confidence level? Use Kupiec’s two-tailed test. 

The change in the value of a portfolio in three months is normally distributed 

with a mean of $500,000 and a standard deviation of $3 million. Calculate the 

VaR and ES for a confidence level of 99.5% and a time horizon of three months. 

The probability that the loss from a portfolio will be greater than $10 million in 

one month is estimated to be 5%. 

(a) What is the one-month 99% VaR, assuming the change in value of the port- 
folio is normally distributed with zero mean? 

(b) What is the one-month 99% VaR, assuming that the power law applies with 
a = 3° 


Chapter 13 


Historical Simulation and 
Extreme Value Theory 


and expected shortfall (ES) for market risk. It is known as historical simulation. It 

involves using the day-to-day changes in the values of market variables that have 
been observed in the past in a direct way to estimate the probability distribution of the 
change in the value of the current portfolio between today and tomorrow. 


I n this chapter, we cover the most popular approach for calculating value at risk (VaR) 


After describing the mechanics of the historical simulation approach, the chapter 
explains a number of extensions that can improve accuracy. It also covers stressed VaR 
and stressed ES, which are used by regulators to determine capital for market risk. Finally, 
it covers extreme value theory. This is a tool that can be used to improve VaR and ES 
estimates and to increase the confidence level used for them. 

All the models covered in this chapter are illustrated with a portfolio consisting of 
an investment in four different stock indices. Historical data on the indices and VaR 
calculations can be found at www-2.rotman.utoronto.ca/~hull/RMFI/VaRExample. 


13.1 The Methodology 


Historical simulation involves using past data as a guide to what will happen in the future. 
Suppose that we want to calculate VaR for a portfolio using a one-day time horizon, a 
99% confidence level, and 501 days of data. (The time horizon and confidence level are 
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those typically used for a market risk VaR calculation; we use 501 days of data because, 
as we will see, it leads to 500 scenarios being created.) 

The first step is to identify the market variables affecting the portfolio. These market 
variables are sometimes referred to as risk factors. They typically include exchange rates, 
interest rates, stock indices, volatilities, and so on. Data are then collected on movements 
in these market variables over the most recent 501 days. This provides 500 alternative 
scenarios for what can happen between today and tomorrow. Denote the first day for 
which we have data as Day 0, the second day as Day 1, and so on. Scenario 1 is where 
the percentage changes in the values of all variables are the same as they were between 
Day 0 and Day 1, Scenario 2 is where they are the same as between Day 1 and Day 2, 
and so on. For each scenario, the dollar change in the value of the portfolio between 
today and tomorrow is calculated. This defines a probability distribution for daily loss 
(with gains counted as negative losses) in the value of the portfolio. The 99 percentile 
of this distribution can be estimated as the fifth worst outcome.' The estimate of VaR 
is the loss when we are at this 99 percentile point. We are 99% certain that we will not 
take a loss greater than the VaR estimate if the percentage changes in market variables in 
the past 500 days are representative of what will happen between today and tomorrow. 
ES is the average loss conditional that we are in the 1% tail of the loss distribution. VaR 
is estimated as the fifth worst loss. ES can be estimated by averaging the losses that are 
worse than VaR—that is, the four worst losses? 

To express the approach algebraically, define v; as the value of a market variable on 
Day i and suppose that today is Day n. The ith scenario in the historical simulation 
approach assumes that the value of the market variable tomorrow will be 

v. 


Value under ith Scenario = v, — (13.1) 
Vi 


For some variables such as interest rates, credit spreads, and volatilities, actual rather 
than percentage changes in market variables are considered. It is then the case that equa- 
tion (13.1) becomes: 


Value under ith Scenario = v, + v; — vi1 


To simplify the discussion, in the rest of this chapter we will assume that the historical 
simulation is based on percentage changes in the underlying market variables. 


1 This is what is normally done, but there are alternatives. A case can be made for using the fifth worst 
loss, the sixth worst loss, or an average of the two. In Excel’; PERCENTILE function, when there 
are n observations and k is an integer, the k/(n — 1) percentile is the observation ranked k + 1. Other 
percentiles are calculated using linear interpolation. 

2 Footnote 1 pointed out that there are alternative VaR estimates that can be made from discrete data. 
The same is true for ES. We could, for example, average the five worst observations (i.e., include the 
VaR estimate). Alternatively, we could give the fifth worst loss half the weight of the others. What is 
suggested here seems to correspond to market practice. 
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Table 13.1 Investment Portfolio Used for 
VaR Calculations on September 25, 2008 


Index Portfolio Value ($000s) 
DJIA 4,000 
FTSE 100 3,000 
CAC 40 1,000 
Nikkei 225 2,000 
Total 10,000 


Table 13.2 U.S. Dollar Equivalent of Stock Indices for Historical Simulation Calculation 


Day Date DJIA FTSE 100 CAC 40 Nikkei 225 
0 Aug. 7, 2006 11,219.38 11,131.84 6,373.89 131.77 
1 Aug. 8, 2006 11,173.59 11,096.28 6,378.16 134.38 
2 Aug. 9, 2006 11,076.18 11,185.35 6,474.04 135.94 
3 Aug. 10, 2006 11,124.37 11,016.71 6,357.49 135.44 
499 Sept. 24, 2008 10,825.17 9,438.58 6,033.93 114.26 
500 Sept. 25, 2008 11,022.06 9,599.90 6,200.40 112.82 


13.1.1 Illustration 


To illustrate the calculations underlying the approach, suppose that an investor in the 
United States owns, on September 25, 2008, a portfolio worth $10 million consisting 
of investments in four stock indices: the Dow Jones Industrial Average (DJIA) in the 
United States, the FTSE 100 in the United Kingdom, the CAC 40 in France, and the 
Nikkei 225 in Japan. The value of the investment in each index on September 25, 2008, 
is shown in Table 13.1. An Excel spreadsheet containing 501 days of historical data on 
the closing prices of the four indices and a complete set of VaR calculations are on the 
author’s website? 


www-2.rotman.utoronto.ca/~hull/RMFI/VaRExample 


The calculations for this section are in worksheets 1 to 3. 

Because we are considering a U.S. investor, the values of the FTSE 100, CAC 40, 
and Nikkei 225 must be measured in U.S. dollars. For example, the FTSE 100 stood at 
5,823.40 on August 10, 2006, when the exchange rate was 1.8918 USD per GBP. This 
means that, measured in U.S. dollars, it was at 5,823.40 x 1.8918 = 11,016.71. An extract 
from the data with all indices measured in U.S. dollars is shown in Table 13.2. 


3To keep the example as straightforward as possible, only days when all four indices traded were 
included in the compilation of the data. This is why the 501 items of data extend from August 7, 
2006, to September 25, 2008. In practice, an attempt might be made to fill in data for days that were 
not USS. holidays. 
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Table 13.3 Scenarios Generated for September 26, 2008, Using Data in Table 13.2 (all 
indices measured in U.S. dollars) 


Scenario Portfolio Loss 
Number DJIA FTSE 100 CAC 40 Nikkei 225 Value ($000s) ($000s) 
1 10,977.08 9,569.23 6,204.55 115.05 10,014.334 —14.334 
2 10,925.97 9,676.96 6,293.60 114.13 10,027.481 —27.481 
3 11,070.01 9,455.16 6,088.77 112.40 9,946.736 53.264 
499 10,831.43 9,383.49 6,051.94 113.85 9,857.465 142.535 
500 11,222.53 9,763.97 6,371.45 111.40 10,126.439 —126.439 


September 25, 2008, is an interesting date to choose in evaluating an equity invest- 
ment. The turmoil in credit markets, which started in August 2007, was more than a 
year old. Equity prices had been declining for several months. Volatilities were increas- 
ing. Lehman Brothers had filed for bankruptcy 10 days earlier. The Treasury secretary’s 
$700 billion Troubled Asset Relief Program (TARP) had not yet been passed by the 
United States Congress. 

Table 13.3 shows the values of the indices (measured in USS. dollars) on September 
26, 2008, for the scenarios considered. Scenario 1 (the first row in Table 13.3) shows the 
values of indices on September 26, 2008, assuming that their percentage changes between 
September 25 and September 26, 2008, are the same as they were between August 7 and 
August 8, 2006; Scenario 2 (the second row in Table 13.3) shows the values of indices on 
September 26, 2008, assuming these percentage changes are the same as those between 
August 8 and August 9, 2006; and so on. In general, Scenario i assumes that the percentage 
changes in the indices between September 25 and September 26 are the same as they 
were between Day i — 1 and Day i for 1 < i < 500. The 500 rows in Table 13.3 are the 
500 scenarios considered. 

The DJIA is 11,022.06 on September 25, 2008. On August 8, 2006, it was 11,173.59, 
down from 11,219.38 on August 7, 2006. The value of the DJIA under Scenario 1 is 
therefore 


11,173.59 
11,022.06 x 2113? = 10,977.08 
11,219.38 


Similarly, the value of the FTSE 100, the CAC 40, and the Nikkei 225 (measured in U.S. 
dollars) are 9,569.23, 6,204.55, and 115.05, respectively. The value of the portfolio under 
Scenario 1 is therefore (in $000s): 


4,000 x 12:977-08 5 a9 x 9:56.23 4.4 99 x¢ £20455 4 9 oq9 x 115.05 
11,022.06 9,599.90 6,200.40 


= 10,014.334 
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Figure 13.1 Histogram of Losses for the Scenarios Considered between September 25 and 
September 26, 2008 


The portfolio therefore has a gain of $14,334 under Scenario 1. A similar calculation is 
carried out for the other scenarios. A histogram for the losses (gains are negative losses) is 
shown in Figure 13.1. (The bars on the histogram represent losses, in $000s, in the ranges 
450 to 550, 350 to 450, 250 to 350, and so on.) 

The losses for the 500 different scenarios are then ranked. An extract from the results 
of doing this is shown in Table 13.4. The worst scenario is number 494. The one-day 
99% value at risk can be estimated as the fifth worst loss. This is $253,385. 


Table 13.4 Losses Ranked from 
Highest to Lowest for 500 Scenarios 


Scenario Number Loss ($000s) 
494 477.841 
339 345.435 
349 282.204 
329 277.041 
487 253.385 
227 217.974 
131 202.256 
238 201.389 
473 191.269 
306 191.050 
477 185.127 
495 184.450 
376 182.707 
237 180.105 


365 172.224 
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As explained in Section 12.6, the 10-day 99% VaR is often calculated as y 10 times 
the one-day 99% VaR. In this case, the 10-day VaR would therefore be 


V 10 X 253,385 = 801,274 


or $801,274. 

Each day, the VaR estimate in our example would be updated using the most recent 
501 days of data. Consider, for example, what happens on September 26, 2008 (Day 501). 
We find out new values for all the market variables and are able to calculate a new value 
for our portfolio. We then go through the procedure we have outlined to calculate a new 
VaR. Data on the market variables from August 8, 2006, to September 26, 2008 (Day 
1 to Day 501), are used in the calculation. (This gives us the required 500 observations 
on the percentage changes in market variables; the August 7, 2006, Day 0, values of the 
market variables are no longer used.) Similarly, on the next trading day, September 29, 
2008 (Day 502), data from August 9, 2006, to September 29, 2008 (Day 2 to Day 502), 
are used to determine VaR; and so on. 

In practice, a financial institution’s portfolio is, of course, considerably more compli- 
cated than the one we have considered here. It is likely to consist of thousands or tens 
of thousands of positions. Often some of the positions are in forward contracts, options, 
and other derivatives. Also, the portfolio itself is likely to change from day to day. If the 
financial institution’s trading leads to a riskier portfolio, the 10-day 99% VaR typically 
increases; if it leads to a less risky portfolio, the VaR typically decreases. The VaR on any 
given day is calculated on the assumption that the portfolio will remain unchanged over 
the next business day. 

The market variables (or risk factors) that have to be considered in a VaR calculation 
include exchange rates, commodity prices, and interest rates. In the case of interest rates, 
a financial institution typically needs term structures describing zero-coupon LIBOR, 
Treasury, and OIS interest rates in each of a number of different currencies in order to 
value its portfolio. The market variables that are considered can be the swap rates and 
bond yields from which these term structures are calculated (see Appendix B for the 
calculations to obtain the zero-coupon term structure of interest rates). There might be 
as many as 10 market variables for each zero curve to which the financial institution is 
exposed. 


13.1.2 Expected Shortfall 


To calculate expected shortfall using historical simulation, we average the losses that are 
worse than VaR. In the case of our example, the four worst losses ($000s) are from 
scenarios 494, 339, 349, and 329 (see Table 13.4). The average of the losses for these 
scenarios is $345,630. This is the expected shortfall estimate. 
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13.1.3 Stressed VaR and Stressed ES 


The calculations given so far assume that the most recent data are used for the historical 
simulation on any given day. For example, when calculating VaR and ES for the four- 
index example we used data from the immediately preceding 501 days. We will refer 
to VaR and ES calculated in this way as current VaR and current ES. However, historical 
simulations can be based on data from any period in the past. Periods of high volatility 
will tend to give high values for VaR and ES, whereas periods of low volatility will tend 
to give low values. 

Regulators have introduced measures known as stressed VaR and stressed ES. To cal- 
culate the measures, a financial institution must search for the 251-day period that would 
be particularly stressful for its current portfolio. The data for that 251-day period then 
play the same role as the 501-day period in our example. The changes in market variables 
between Day 0 and Day 1 of the 251-day period are used to create the first scenario; the 
changes in market variables between Day 1 and Day 2 of the 251-day period are used 
to create the second scenario; and so on. In total, 250 scenarios are created. The one-day 
99% stressed VaR can be calculated as the loss that is midway between the loss for the 
second worst scenario and the loss for the third worst scenario. The one-day 99% ES can 


be calculated as the average of the worst two losses.* 


13.2 Accuracy of VaR 


The historical simulation approach estimates the distribution of portfolio changes from a 
finite number of observations. As a result, the estimates of percentiles of the distribution 
are subject to error. 

Kendall and Stuart (1972) describe how to calculate a confidence interval for the per- 
centile of a probability distribution when it is estimated from sample data Suppose that 
the q-percentile of the distribution is estimated as x. The standard error of the estimate 
is 


1 /Gd=494 
F(x) n 


where n is the number of observations and f(x) is an estimate of the probability den- 
sity function of the loss evaluated at x. The probability density, f (x), can be estimated 


‘These are not the only approaches. VaR could be calculated as the third worst loss or the second 
worst loss. (Some regulators such as the Federal Reserve in the United States prefer the latter.) The 
ES can be calculated as 0.44 + 0.46 + 0.2c, where 4, ©, and c are the three worst losses with q > 
Co > 63. 

>See M. G. Kendall and A. Stuart, The Advanced Theory of Statistics, vol. 1, Distribution Theory, 4th ed. 
(London: Charles Griffin, 1972). 
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approximately by fitting the empirical data to an appropriate distribution whose prop- 
erties are known. 


Example 13.1 


Suppose we are interested in estimating the 99th percentile of a loss distribution from 
500 observations so that n = 500 and q = 0.99. We can estimate f(x) by approximat- 
ing the actual empirical distribution with a standard distribution whose properties are 
known. Suppose that a normal distribution is chosen as the standard distribution and 
the best-fit mean and standard deviation are zero and $10 million, respectively. Using 
Excel, the 99th percentile is NORMINV(0.99,0,10) or 23.26. The value of f(x) is 
NORMDIST(23.26,0,10,FALSE) or 0.0027. The standard error of the estimate that 


is made is 
1 x 0.01 x 0.99 = 1.67 
0.0027 V 500 


If the estimate of the 99th percentile using historical simulation is $25 million, a 95% 
confidence interval is from 25 — 1.96 X 1.67 to 25 + 1.96 x 1.67, that is, from $21.7 
million to $28.3 million. 


As Example 13.1 illustrates, the standard error of a VaR estimated using historical 
simulation tends to be quite high. It decreases as the VaR confidence level is decreased. 
For example, ifin Example 13.1 the VaR confidence level had been 95% instead of 99%, 
the standard error would be $0.95 million instead of $1.67 million. The standard error 
declines as the sample size is increased—but only as the square root of the sample size. 
If we quadrupled the sample size in Example 13.1 from 500 to 2,000 observations, the 
standard error halves from $1.67 million to about $0.83 million. 

Additionally, we should bear in mind that historical simulation assumes that the 
joint distribution of daily changes in market variables is stationary through time. This 
is unlikely to be exactly true and creates additional uncertainty about VaR. 

In the case of the data considered in Tables 13.1 to 13.4, when the loss is measured 
in $000s, the mean is 0.870 and the standard deviation is 93.698. If a normal distribution 
is assumed, a similar calculation to that in Example 13.1 gives f(x) as 0.000284 and the 
standard error of the estimate (in $000s) is 


1 0.01 x 0.99 


— = 15.643 
0.000284 500 


The estimate of VaR is $253,385. This shows that a 95% confidence interval for the VaR 
is about $220,000 to $280,000. 

The normal distribution is not a particularly good assumption for the loss distribu- 
tion, because losses have heavier tails than the normal distribution. (Excess kurtosis, a 
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measure of heaviness of tails, is 4.2 for the data in Tables 13.1 to 13.4.) Better standard 
error estimates can be obtained by assuming a Pareto distribution for f(x), as discussed 
in Section 13.5. 


13.3 Extensions 


When current VaR and current ES are calculated (using data from an immediately pre- 
ceding period), the assumption is that recent history is in some sense a good guide to 
the future. More precisely, it is that the empirical probability distribution estimated for 
market variables over the immediately preceding period is a good guide to the behavior 
of the market variables over the next day. Unfortunately, the behavior of market variables 
is nonstationary. Sometimes the volatility of a market variable is high; sometimes it is low. 
In this section, we cover extensions of the basic historical simulation approach in Sec- 
tion 13.1 that are designed to adjust for nonstationarity. We also show how an approach 
known as the bootstrap method can be used to determine standard errors. 


13.3.1 Weighting of Observations 


The basic historical simulation approach assumes that each day in the past is given equal 
weight. More formally, if we have observations for n day-to-day changes, each of them 
is given a weighting of 1/n. Boudoukh, Richardson, and Whitelaw (1998) suggest that 
more recent observations should be given more weight because they are more reflective 
of current volatilities and current macroeconomic conditions The natural weighting 
scheme to use is one where weights decline exponentially. (We used this when developing 
the exponentially weighted moving average model for monitoring volatility in Chapter 
10.) The weight assigned to Scenario 1 (which is the one calculated from the most 
distant data) is A times that assigned to Scenario 2. This in turn is À times that given to 
Scenario 3, and so on. So that the weights add up to 1, the weight given to Scenario 
i is 


N — A) 
12y 


where nis the number of scenarios. As A approaches 1, this approaches the basic historical 
simulation approach where all observations are given a weight of 1/n. (See Problem 13.2.) 

VaR is calculated by ranking the observations from the worst outcome to the best. 
Starting at the worst outcome, weights are summed until the required percentile of the 
distribution is reached. For example, if we are calculating VaR with a 99% confidence 
level, we continue summing weights until the sum just exceeds 0.01. We have then 


é See J. Boudoukh, M. Richardson, and R. Whitelaw, “The Best of Both Worlds: A Hybrid Approach 
to Calculating Value at Risk,” Risk 11 (May 1998): 64-67. 
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Table 13.5 Losses Ranked from Highest to Lowest for 500 Scenarios, 
with Weights 


Scenario Number Loss ($000s) Weight Cumulative Weight 


494 477.841 0.00528 0.00528 
339 345.435 0.00243 0.00771 
349 282.204 0.00255 0.01027 
329 277.041 0.00231 0.01258 
487 253.385 0.00510 0.01768 
227 217.974 0.00139 0.01906 
131 202.256 0.00086 0.01992 
238 201.389 0.00146 0.02138 
473 191.269 0.00476 0.02614 


reached the 99% VaR level. The parameter À can be chosen by trying different values 
and seeing which one back-tests best. One disadvantage of the exponential weighting 
approach relative to the basic historical simulation approach is that the effective sample 
size 1s reduced. However, we can compensate for this by using a larger value of n. Indeed, 
it is not really necessary to discard old days as we move forward in time, because they are 
given relatively little weight. 

Table 13.5 shows the results of using this procedure for the portfolio considered in 
Section 13.1 with A = 0.995. (See worksheets 4 and 5 on the author’s website.) The value 
of VaR when the confidence level is 99% is now the third worst loss, $282,204 (not the 
fifth worst loss of $253,385). The reason for this result is that recent observations are 
given more weight and the largest losses have occurred relatively recently. The standard 
calculation in Section 13.1 gives all observations a weighting of 1/500 = 0.002. The 
highest loss occurred on Scenario 494, and this scenario has a weight of 


(0.995°) x 0.005 


1 — 0.995500 = 0.00528 


The 0.01 tail of the loss distribution consists of a probability 0.00528 of a loss of $477,841, 
a 0.00243 probability of a loss of $345,435, and a 0.01 — 0.00528 — 0.00243 = 0.00228 
probability of a loss of $282,204. The expected shortfall can therefore be calculated as’ 


0.00528 x 477,841 + 0.00243 X 345,435 + 0.00228 x 282,204 


= 400,914 
0.01 


7 This calculation seems to fit in with the nature of weighted observations approach best. An alternative 
would be to calculate a weighted average of the two worst losses. 
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13.3.2 Volatilty Scaling for Market Variables 


Hull and White (1998) suggest a way of incorporating estimates of volatility into the 
historical simulation approach Define the daily volatility for a particular market variable 
estimated at the end of day i — 1 as 6,. This can be considered to be an estimate of the 
daily volatility between the end of day i — 1 and the end of day i. Suppose that it is now 
day n so that the current estimate of the volatility of the market variable (i.e., the volatility 
between today and tomorrow) is 6,,,;- 

Suppose that 6,,, is twice 6; for a particular market variable. This means that we 
estimate the daily volatility of the market variable to be twice as great today as on day 
i—1.The changes we expect to see between today and tomorrow are twice as big as 
changes between day i — 1 and day i. When carrying out the historical simulation and 
creating a sample of what could happen between today and tomorrow based on what 
happened between day i — 1 and day i, it therefore makes sense to multiply the latter by 
2. In general, when this approach is used, the expression in equation (13.1) for the value 
of a market variable under the ith scenario becomes 

Mint + (Vi — M1) Ont [9 


Value under ith Scenario = v, ——————_ (13.2) 
Vi—1 


Each market variable can be handled in the same way. 

This approach takes account of volatility changes in a natural and intuitive way and 
produces VaR estimates that incorporate more current information. The VaR estimates 
can be greater than any of the historical losses that would have occurred for the current 
portfolio during the historical period considered. Hull and White produce evidence 
using exchange rates and stock indices to show that this approach is superior to traditional 
historical simulation and to the exponential weighting scheme described earlier. 

For the data in Table 13.2, the daily volatility estimates, calculated using the exponen- 
tially weighted moving average (EWMA) method with the À parameter equal to 0.94, are 
shown in Table 13.6? (See worksheets 6 to 8 on the author’s website.) The ratios of the 
volatility estimated for September 26, 2008 (last row of table), to the volatility estimated 
for August 8, 2008 (first row of table), are 1.98, 2.26, 2.21, and 1.15 for the DJIA, FTSE 
100, CAC 40, and Nikkei 225, respectively. These are used as multipliers for the actual 
changes in the indices between August 7 and August 8, 2006. Similarly, the ratios of the 
volatility estimated for September 26, 2008 (last row of table), to the volatility estimated 
for August 9, 2008 (second row of table), are 2.03, 2.33, 2.28, and 1.12 for the DJIA, 
FTSE 100, CAC 40, and Nikkei 225, respectively. These are used as multipliers for the 
actual changes in the indices between August 8 and August 9, 2006. Multipliers for the 
other 498 daily changes are calculated in the same way. 


8 See J. Hull and A. White, “Incorporating Volatility Updating into the Historical Simulation Method 
for Value at Risk,’ Journal of Risk (Fall 1998): 5-19. 

? A decision must be made on how to start the variance time series. The initial variance in the calcu- 
lations reported here was the sample variance calculated over the whole time period. 
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Table 13.6 Volatilities (% per Day) Estimated for the Following Day Using EWMA 


Day Date DJIA FTSE 100 CAC 40 Nikkei 225 

0 Aug. 7, 2006 1.11 1.42 1.40 1.38 

1 Aug. 8, 2006 1.08 1.38 1.36 1.43 

2 Aug. 9, 2006 1.07 1.35 1.36 1.41 

3 Aug. 10, 2006 1.04 1.36 1.39 1.37 
499 Sept. 24, 2008 2.21 3.28 3.11 1.61 
500 Sept. 25, 2008 2.19 3.21 3.09 1.59 


Because volatilities were highest at the end of the historical period in our example, 
the effect of the volatility adjustments is to create more variability in the gains and losses 
for the 500 scenarios. Table 13.7 shows an extract from a table that ranks losses from the 
highest to the lowest. Comparing this with Table 13.4, we see that the losses are much 
higher. The one-day 99% VaR is $602,968. The one-day ES is $786,855. These are more 
than twice as high as the estimates given by standard calculations. 

In this particular case, the volatility of the stock indices remained high for the rest of 
2008, with daily changes of between 5% and 10% in the indices being not uncommon. 
Estimating VaR and ES using the volatility-adjusted approach would have worked better 
than using the standard approach. 


13.3.3 Volatility Scaling for the Portfolio 


A variation on the approach we just have described is to use EWMA to monitor the 
standard deviation of the simulated losses given by successive scenarios in the standard 
approach in Section 13.1. The losses are those given in the final column of Table 13.3. 


Table 13.7 Losses Ranked from Highest to 
Lowest for 500 Scenarios When Market 
Variable Movements Are Adjusted for Volatility 


Scenario Number Loss ($000s) 
131 1,082.969 
494 715.512 
227 687.720 
98 661.221 
329 602.968 
339 546.540 
74 492.764 
193 470.092 


487 458.177 
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Table 13.8 Results When Volatility of Simulated Losses Is Monitored 


Scenario Loss Given by Loss SD SD Adjusted 
Number Standard Approach ($000s) Ratio Loss 

1 —14.334 93.698 2.203 —31.571 

—27.481 90.912 2.270 —62.385 

3 53.264 88.399 2.335 124.352 

499 142.535 209.795 0.984 140.214 

500 —126.439 206.378 1.000 —126.439 


(See worksheets 9 and 10 on the author’s website for the calculations.) An adjusted 
loss for the ith scenario is then calculated by multiplying the loss given by the standard 
approach by the ratio of the estimated standard deviation for the last (500th) scenario 
to the estimated standard deviation for the ith scenario. This procedure is much simpler 
than incorporating volatility on a variable-by-variable basis and has the advantage that 
changing correlations as well as changing volatilities are implicitly considered. Table 13.8 
shows the calculation of the portfolio loss standard deviation and the adjusted losses for 
our example. (Similarly to before, the Day 1 loss variance is the sample variance calculated 
over the whole 500-day period, and the À parameter used in EWMA calculations is 0.94.) 
It can be seen that the estimated standard deviation of the losses for later scenarios is much 
larger than that for early scenarios. The losses for scenarios 1, 2,3, ... are multiplied by 
2.203, 2.270, 2.335, ... 

Table 13.9 shows the ranked adjusted losses. The results are similar to those in 
Table 13.7 (but much easier to produce). The one-day 99% VaR is $627,916, and the 
one-day expected shortfall is $777,545. 


Table 13.9 Adjusted Losses Ranked from Highest 
to Lowest for 500 Scenarios When Simulated Losses 
Are Adjusted for Their Volatility 


Scenario Number Adjusted Loss ($000s) 
131 891.403 
494 763.818 
227 757.355 
339 697.604 
98 627.916 
329 609.815 
283 523.259 
487 512.525 


441 456.700 
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13.3.4 Bootstrap Method 


The bootstrap method is a variation on the basic historical simulation approach, aimed at 
calculating a confidence interval for VaR (current or stressed).!° It involves creating a set 
of changes in the portfolio value based on historical movements in market variables in 
the usual way. We then sample with replacement from these changes to create many new 
similar data sets. We calculate the VaR for each of the new data sets. Our 95% confidence 
interval for VaR is the range between the 2.5 percentile point and the 97.5 percentile 
point of the distribution of the VaRs calculated from the data sets. 

Suppose, for example, that we have 500 days of data. We could sample with replace- 
ment 500,000 times from the data to obtain 1,000 different sets of 500 days of data. We 
calculate the VaR for each set. We then rank the VaRs. Suppose that the 25th largest 
VaR is $5.3 million and the 975th largest VaR is $8.9 million. The 95% confidence 
interval for VaR is $5.3 million to $8.9 million. Usually, the width of the confidence 
interval calculated for VaR using the bootstrap method is less than that calculated using 
the procedure in Section 13.2. 


13.4 Computational Issues 


Historical simulation involves valuing the whole portfolio of a financial institution many 
times (500 times in our example). This can be computationally very time consum- 
ing. This is particularly true when some of the instruments in the portfolio are val- 
ued with Monte Carlo simulation, because there is then a simulation within a simu- 
lation problem because each trial of the historical simulation involves a Monte Carlo 
simulation. 

To reduce computation time, financial institutions sometimes use a delta-gamma 
approximation. This is explained in Chapter 8. Consider an instrument whose price, P, 
is dependent on a single market variable, S. An approximate estimate of the change, AP, 
in P resulting from a change, AS, in S is 


AP =6AS+ IAS? (13.3) 


where 6 and y are the delta and gamma of P with respect to S. The Greek letters 6 and y 
are always known because they are calculated when the instrument is marked to market 
each day. This equation can therefore be used as a fast approximate way of calculating 
the changes in the value of the transaction for the changes in the value of S that are 
considered by the historical simulation. 


10 See P Christoffersen and S. Gonçalves, “Estimation Risk in Financial Risk Management,” Journal of 
Risk 7, no. 3 (2007): 1-28. 
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When an instrument depends on several market variables, S,(1 < i < n), equa- 
tion (13.3) becomes 


AP = y 8, AS, + 5 5 Ly ASAS, (13.4) 
i=1 


i=1 j=1 


where 6, and y; are defined as 


13.5 Extreme Value Theory 


Section 10.4 introduced the power law and explained that it can be used to estimate 
the tails of a wide range of distributions. We now provide the theoretical underpinnings 
for the power law and present estimation procedures that are more sophisticated than 
those used in Section 10.4. Extreme value theory (EVT) is the term used to describe the 
science of estimating the tails of a distribution. EVT can be used to improve VaR or 
ES estimates and to help in situations where analysts want to estimate VaR with a very 
high confidence level. It is a way of smoothing and extrapolating the tails of an empirical 
distribution. 


13.5.1 The Key Result 


The key result in EVT was proved by Gnedenko (1943).!' It shows that the tails of a 
wide range of different probability distributions share common properties. 

Suppose that F(v) is the cumulative distribution function for a variable v (such as the 
loss on a portfolio over a certain period of time) and that u is a value of v in the right- 
hand tail of the distribution. The probability that v lies between u and u + y (y > 0) is 
F(u + y) — F(u). The probability that v is greater than u is 1 — F(u). Define F,,(y) as the 
probability that v lies between u and u + y conditional on v > u. This is 


F(u + y) — F(u) 
F.) = an 

1 — F(u) 

The variable F,,(y) defines the right tail of the probability distribution. It is the cumu- 
lative probability distribution for the amount by which v exceeds u given that it does 
exceed u. 


'lSee D. V. Gnedenko, “Sur la distribution limité du terme d’une série aléatoire,” Annals of Mathematics 
44 (1943): 423—453. 
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Gnedenko’s result states that, for a wide class of distributions F(v), the distribution of 
F (y) converges to a generalized Pareto distribution as the threshold u is increased. The 
generalized Pareto (cumulative) distribution is 


-1/§ 


y 
EEE 


The distribution has two parameters that have to be estimated from the data. These 
are € and B. The parameter € is the shape parameter and determines the heaviness of the 
tail of the distribution. The parameter B is a scale parameter. 

When the underlying variable v has a normal distribution, & = 0.1? As the tails of the 
distribution become heavier, the value of € increases. For most financial data, € is positive 
and in the range 0.1 to 0.4.1 


13.5.2 Estimating € and B 


The parameters € and B can be estimated using maximum likelihood methods (see Sec- 
tion 10.9 for a discussion of maximum likelihood methods). The probability density 
function, gz p(y), of the cumulative distribution in equation (13.5) is calculated by differ- 
entiating Gg p(y) with respect to y. It is 


-1/e-1 
Le pO) = (: + 2r) (13.6) 


We first choose a value for u. (A value close to the 95th percentile point of the 
empirical distribution usually works well.) We then rank the observations on v from 
the highest to the lowest and focus our attention on those observations for which v > u. 
Suppose there are n,, such observations and they are v; (1 < i < n,). From equation (13.6), 
the likelihood function (assuming that & # 0) is 


When € = 0, the generalized Pareto distribution becomes 


Ge p(y) = 1 — exp (-2) 


13One of the properties of the distribution in equation (13.5) is that the kth moment of v, E(v*), is 
infinite for k > 1/&. For a normal distribution, all moments are finite. When & = 0.25, only the first 
three moments are finite; when € = 0.5, only the first moment is finite; and so on. 
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Maximizing this function is the same as maximizing its logarithm: 


ny, = -1/E-1 
Şin i (: S c 2) | (13.7) 
i=1 


Standard numerical procedures can be used to find the values of é and B that maximize 


this expression. Excel’s Solver produces good results. 


13.5.3 Estimating the Tail of the Distribution 


The probability that v > u + y conditional that v > u is 1 — G g(y). The probability that 
v> uis 1 — F(u). The unconditional probability that v > x (when x > u) is therefore 


[1 — F] — Ge pe — u)] 


If n is the total number of observations, an estimate of 1 — F(u), calculated from the 
empirical data, is n/n. The unconditional probability that v > x is therefore 


n n 
Prob(v > x) = = [1 — Ge g(x — u)] = — 
n n 


1+6 


el: 
a | (13.8) 


p 


13.5.4 Equivalence to the Power Law 


If we set u = B/E, equation (13.8) reduces to 


ny [x] 
Prob(v > x) = ra = 


This is 


where 


and a = 1/&. This shows that equation (13.8) is consistent with the power law introduced 
in Section 10.4. 
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13.5.5 The Left Tail 


The analysis so far has assumed that we are interested in the right tail of the probability 
distribution of a variable v. If we are interested in the left tail of the probability distribu- 
tion, we can work with —v instead of v. Suppose, for example, that an oil company has 
collected data on daily percentage increases in the price of oil and wants to estimate a 
VaR that is the one-day percentage decline in the price of oil that has a 99.9% probability 
of not being exceeded. This is a statistic calculated from the left tail of the probability 
distribution of oil price increases. The oil company would change the sign of each data 
item (so that the data were measuring oil price decreases rather than increases) and then 
use the methodology that has been presented. 


13.5.6 Calculation of VaR and ES 


To calculate VaR with a confidence level of q, it is necessary to solve the equation 


F(VaR) = q 


Because F(x) = 1 — Prob(v > x), equation (13.8) gives 


n 
q=1-— 
n 


-1/6 
i+ p ver — "| 


P 


so that 


weal) E À 
aR = u+ E a —q}) -1 (13.9) 


The expected shortfall is given by 


_ VaR +B - Eu 
aa ae (13.10) 


13.6 Applications of EVT 


Consider again the data in Tables 13.1 to 13.4. When u = 160, n, = 22 (that is, there 
are 22 scenarios where the loss in $000s 1s greater than 160). Table 13.10 shows calcula- 
tions for the trial values B = 40 and € = 0.3. The value of the log-likelihood function in 
equation (13.7) is —108.37. 

When Excel’s Solver is used to search for the values of B and & that maximize the 
log-likelihood function (see worksheet 11 on the author’s website), it gives 


B = 32.532 
E = 0.436 


and the maximum value of the log-likelihood function is —108.21. 
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Table 13.10 Extreme Value Theory Calculations for Table 13.4 (the parameter 
u is 160 and trial values for B and € are 40 and 0.3, respectively) 


1 &(v; — u) aise 

Scenario Number Loss ($000s) Rank p ( B ) 
494 477.841 1 —8.97 
339 345.435 2 —7.47 
349 282.204 3 —6.51 
329 277.041 4 —6.42 
487 253.385 5 —5.99 
227 217.974 6 —5.25 
131 202.256 7 —4.88 
238 201.389 8 —4.86 
304 160.778 22 —3.71 

—108.37 
Trial Estimates of EVT Parameters 
E B 
0.3 40 


Suppose that we wish to estimate the probability that the portfolio loss between 
September 25 and September 26, 2008, will be more than $300,000 (or 3% of its value). 
From equation (13.8) this is 


_ -1/0.436 
22 lipo ee = 0.0039 
500 32.532 


This is more accurate than counting observations. The probability that the portfolio loss 
will be more than $500,000 (or 5% of its value) is similarly 0.00086. 
From equation (13.9), the value of VaR with a 99% confidence limit is 


—0.436 
ki 22322 { [sea _ 0.99)] = i} = 227.8 


0.436 22 


or $227,800. (In this instance, the VaR estimate is about $25,000 less than the fifth worst 
loss.) When the confidence level is increased to 99.9%, VaR becomes 


—0.436 
iga oo ee { [2u z 0.999)| = i} = 474.0 


0.436 22 


or $474,000. When it is increased further to 99.97%, VaR. becomes 


—0.436 
ioe Á [Ra -— 0.9997)| —1 > =7425 
0.436 | 122 


or $742,500. 
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The formula in equation (13.10) can improve ES estimates and allow the confidence 
level used for ES estimates to be increased. In our example, when the confidence level is 
99%, the estimated ES is 


227.8 + 32.532 — 0.436 X 160 
1 — 0.436 


= 337.9 


or $337,900. When the confidence level is 99.9%, the estimated ES is 


474.0 + 32.532 — 0.436 x 160 
1 — 0.436 


= 774.8 


or $774,800. 

EVT can be used for current or stressed measures. It can also be used in a straight- 
forward way in conjunction with the volatility-scaling procedures in Section 13.3 (see 
Problem 13.11). It can also be used in conjunction with the weighting-of-observations 
procedure in Section 13.3. In this case, the terms being summed in equation (13.7) must 
be multiplied by the weights applicable to the underlying observations. 

A final calculation can be used to refine the confidence interval for the 99% VaR 
estimate in Section 13.2. The probability density function evaluated at the VaR level for 
the probability distribution of the loss, conditional on it being greater than 160, is given 
by the g; p function in equation (13.6). It is 


1 0.436 X (227.8 — 160) —1/0.436-1 
t+ 35532 = 0.0037 
32.532 32.532 


The unconditional probability density function evaluated at the VaR level is n,/n= 
22/500 times this or 0.00016. Not surprisingly, this is lower than the 0.000284 estimated 
in Section 13.2 and leads to a wider confidence interval for VaR. 


13.6.1 Choice of u 


A natural question is: “How do the results depend on the choice of u?” It is often 
found that values of € and B do depend on u, but the estimates of F(x) remain roughly 
the same. (Problem 13.10 considers what happens when u is changed from 160 to 
150 in the example we have been considering.) We want u to be sufficiently high 
that we are truly investigating the shape of the tail of the distribution, but sufficiently 
low that the number of data items included in the maximum likelihood calculation is 
not too low. More data lead to more accuracy in the assessment of the shape of the 
tail. We have applied the procedure with 500 data items. Ideally, more data would be 
used. 

A rule of thumb is that u should be approximately equal to the 95th percentile of 
the empirical distribution. (In the case of the data we have been looking at, the 95th 
percentile of the empirical distribution is 156.5.) In the search for the optimal values 
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of € and ß, both variables should be constrained to be positive. If the optimizer tries 
to set E negative, it is likely to be a sign that either (a) the tail of the distribution is 
not heavier than the normal distribution or (b) an inappropriate value of u has been 
chosen. 


Summary 


Historical simulation is a very popular approach for estimating VaR or ES. It involves 
creating a database consisting of the daily movements in all market variables over a 
period of time. The first simulation trial assumes that the percentage change in each 
market variable is the same as that on the first day covered by the database, the second 
simulation trial assumes that the percentage changes are the same as those on the sec- 
ond day, and so on. The change in the portfolio value is calculated for each simulation 
trial, and VaR is calculated as the appropriate percentile of the probability distribution 
of this change, while ES is calculated as the average change in value in the tail of this 
distribution. 

The standard error for a VaR that is estimated using historical simulation tends to 
be quite high. The higher the VaR confidence level required, the higher the standard 
error. In one extension of the basic historical simulation approach, the weights given 
to observations decrease exponentially as the observations become older; in another, 
adjustments are made to historical data to reflect changes in volatility. 

Extreme value theory is a way of smoothing the tails of the probability distribution 
of portfolio daily changes calculated using historical simulation. It avoids the calculation 
uncertainties mentioned in footnotes 1, 2, 4, and 7. It leads to estimates of VaR and ES 
that reflect the whole shape of the tail of the distribution, not just the positions of a few 
losses in the tails. Extreme value theory can also be used to estimate VaR and ES when 
the confidence level is very high. For example, even if we have only 500 days of data, 
it could be used to come up with an estimate of VaR or ES for a confidence level of 
99.9%. 
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Practice Questions and Problems 
(Answers at End of Book) 


What assumption is being made when VaR is calculated using the historical sim- 
ulation approach and 500 days of data? 

Show that when à approaches 1 the weighting scheme in Section 13.3.1 
approaches the basic historical simulation approach. 

Suppose we estimate the one-day 95% VaR from 1,000 observations (in millions 
of dollars) as 5. By fitting a standard distribution to the observations, the probability 
density function of the loss distribution at the 95% point is estimated to be 0.01. 
What is the standard error of the VaR estimate? 

The one-day 99% VaR for the four-index example is calculated in Section 13.1 
as $253,385. Look at the underlying spreadsheets on the author’s website and 
calculate (a) the 95% one-day VaR, (b) the 95% one-day ES, (c) the 97% one-day 
VaR, and (d) the 97% one-day ES. 

Use the spreadsheets on the author’s website to calculate a one-day 99% VaR and 
the one-day 99% ES using the basic methodology in Section 13.1, if the portfolio 
in Section 13.1 is equally divided between the four indices. 

The weighting-of-observations procedure in Section 13.3.1 gives the one-day 
99% VaR equal to $282,204 and the one-day ES as $400,914. Use the spreadsheets 
on the author’s website to calculate VaR and ES when the A parameter in this 
procedure is changed from 0.995 to 0.99. 

The simplified volatility-scaling for portfolio procedure in Section 13.3.3 gives 
the one-day 99% VaR equal to $627,916 and the one-day ES as $777,545. Use 
the spreadsheets on the author’s website to calculate VaR and ES when the A 
parameter in this procedure is changed from 0.94 to 0.96. 

In the application of extreme value theory (EVT) in Section 13.6, what is the 
probability that the loss will exceed $400,000? 

In the application of EVT in Section 13.6, what is the one-day VaR with a con- 
fidence level of 97%? 

Change u from 160 to 150 in the application of EVT in Section 13.6. How does 
this change the maximum likelihood estimates of é and B? How does it change 
the one-day 99% VaR and the one-day 99% ES when the confidence limit is (a) 
99% and (b) 99.9%? 

Carry out an extreme value theory analysis on the data from the volatility-scaling 
procedure in Table 13.7 and on the author’s website. Use u = 400. What are the 
best fit values of € and P? Calculate the one-day VaR and a one-day ES with a 
99% and 99.9% confidence level. What is the probability of a loss greater than 
$600,000? 


Further Questions 


Suppose that a one-day 97.5% VaR is estimated as $13 million from 2,000 obser- 
vations. The one-day changes are approximately normal with mean zero and 
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13.13 


13.14 


13.15 


13.16 


13.17 


standard deviation $6 million. Estimate a 99% confidence interval for the VaR 

estimate. 

Suppose that the portfolio considered in Section 13.1 has (in $000s) 3,000 in DJIA, 

3,000 in FTSE, 1,000 in CAC 40, and 3,000 in Nikkei 225. Use the spreadsheet 

on the author’s website to calculate what difference this makes to: 

(a) The one-day 99% VaR and ES that are calculated in Section 13.1 

(b) The one-day 99% VaR and ES that are calculated using the weighting-of- 
observations procedure in Section 13.3.1 with A = 0.995. 

(c) The one-day 99% VaR and ES that are calculated using the two volatility- 
scaling procedures in Sections 13.3.2 and 13.3.3 with A = 0.94 (assume that 
the initial variance when EWMaA is applied is the sample variance). 

(d) The one-day 99% VaR and ES that are calculated using extreme value theory 
and equal weightings in Section 13.6. 

Investigate the effect of applying extreme value theory to the volatility-adjusted 

results in Section 13.3.2 with u = 350. 

The weighting-of-observations procedure in Section 13.3.1 gives the one-day 

99% VaR equal to $282,204 and the one-day 99% ES as $400,914. Use the spread- 

sheets on the author’s website to calculate these measures when the A parameter 

in this procedure is changed from 0.995 to 0.99. 

The volatility-scaling procedure in Section 13.3.2 gives the one-day 99% VaR 

equal to $602,968 and the 99% ES as $786,855. Use the spreadsheets on the 

author’s website to calculate VaR and ES when the A parameter in this procedure 

is changed from 0.94 to 0.92. 

Values for the NASDAQ Composite index during the 1,500 days preceding 

March 10, 2006, can be downloaded from the author’s website. Calculate the 

one-day 99% VaR and one-day 99% ES on March 10, 2006, for a $10 million 

portfolio invested in the index using: 

(a) The basic historical simulation approach 

(b) The exponential weighting scheme in Section 13.3.1 with A = 0.995 

(c) The volatility-scaling procedures in Sections 13.3.2 and 13.3.3 with A = 0.94 
(assume that the initial variance when EWMaA is applied is the sample vari- 
ance). 

(d) Extreme value theory with u = 300 and equal weightings. 

(e) A model where daily returns are assumed to be normally distributed with 
mean zero (use both an equally weighted approach and the EWMA approach 
with à = 0.94 to estimate the standard deviation of daily returns). 

Discuss the reasons for the differences between the results you get. 


Chapter 14 


Model-Building Approach 


n alternative to the historical simulation approach for calculating risk mea- 
Ae such as value at risk (VaR) and expected shortfall (ES) is the model- 

building approach, sometimes also referred to as the variance—covariance approach. 
This involves assuming a model for the joint distribution of changes in market variables 
and using historical data to estimate the model parameters. 

The model-building approach is ideally suited to the situation where the change in 
value of a portfolio is linearly dependent on changes in the values of the underlying 
market variables (stock prices, exchange rates, interest rates, etc.). In this case, if the daily 
changes in the underlying market variables are assumed to be multivariate normal, it is 
computationally much faster than historical simulation. It is in essence an extension of 
Harry Markowitz’s pioneering work in portfolio theory (see Section 1.1). The probabil- 
ity distribution of the change in the value of the portfolio is normal, and the mean and 
standard deviation of the change in the value of the portfolio can be calculated analyt- 
ically from the mean and standard deviation of changes in the market variables and the 
correlations between those changes. 

When a portfolio includes options and other non-linear derivatives, the probability 
distribution of the change in the value of a portfolio over a short period of time can no 
longer reasonably be assumed to be normal. The gamma exposures of the portfolio cause 
the probability distribution of change in its value to exhibit skewness. One approach to 
estimating VaR or ES in this situation is Monte Carlo simulation. Unfortunately, the 
model-building approach is then as computationally time consuming as the historical 
simulation approach in Chapter 13. 
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This chapter provides the background for an understanding of two important models 
that are explained later in this book. The first is the Standard Initial Margin Model 
(SIMM), which is used to determine initial margin requirements for over-the-counter 
derivatives that are cleared bilaterally (see Chapter 17). The second is the model used 
in the standardized approach for determining capital in the Fundamental Review of the 
Trading Book (see Chapter 18). 


14.1 The Basic Methodology 


We start by considering how VaR is calculated using the model-building approach in a 
very simple situation where the portfolio under consideration consists of a position in a 
single stock. The portfolio we consider is one consisting of shares in Microsoft valued at 
$10 million. We suppose that the time horizon is 10 days and the VaR confidence level 
is 99%, so we are interested in the loss level over 10 days that we are 99% confident will 
not be exceeded. Initially, we consider a one-day time horizon. 

We assume that the volatility of Microsoft is 2% per day (corresponding to about 
32% per year).' Because the size of the position is $10 million, the standard deviation of 
daily changes in the value of the position is 2% of $10 million, or $200,000. 

It is customary in the model-building approach to assume that the expected change 
in a market variable over the time period considered is zero. This is not exactly true, but 
it is a reasonable assumption. The expected change in the price of a market variable over 
a short time period is generally small when compared to the standard deviation of the 
change. Suppose, for example, that Microsoft has an expected return as high as 20% per 
annum. Over a one-day period, the expected return is 0.20/252, or about 0.08%, whereas 
the standard deviation of the return is 2%. Over a 10-day period, the expected return 


is about 0.08% x 10, or 0.8%, whereas the standard deviation of the return is 21/10, or 
about 6.3%. 

So far, we have established that the change in the value of the portfolio of Microsoft 
shares over a one-day period has a standard deviation of $200,000 and (at least approx- 
imately) a mean of zero. We assume that the change is normally distributed? Because 
N(—2.326) = 0.01, there is a 1% probability that a normally distributed variable will 
decrease in value by more than 2.326 standard deviations. Equivalently, we are 99% cer- 
tain that a normally distributed variable will not decrease in value by more than 2.326 


1 As discussed in Section 10.1, in risk management calculations volatility is usually measured per day 
whereas in option pricing it is measured per year. A volatility per day can be converted to a volatil- 
ity per year by multiplying by 1/252, or about 16. 

? We could assume that the price of Microsoft is lognormal tomorrow. Because one day is such a short 
period of time, this is almost indistinguishable from the assumption we do make that the change in 
the stock price between today and tomorrow is normal. 
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standard deviations. The one-day 99% VaR for our portfolio consisting of a $10 million 
position in Microsoft is therefore 


2.326 X 200,000 = $465,300 


Assuming that the changes in Microsoft’s stock price returns on successive days are 
independent, we can assume that the standard deviation of the return over 10 days is 


0.02 x 10 and that the 10-day return is normally distributed. This gives the 10-day 
99% VaR for Microsoft as 


1,000,000 x 0.02 x y 10 x 2.326 = $1,471,300 


The 10-day 99% ES is given by equation (12.2) with o = 200,000+/10 = 632,500, Y = 
2.326, and X = 0.99. It is $1,687,000. 

Consider next a portfolio consisting of a $5 million position in AT&T, and suppose 
the daily volatility of AT&T is 1% (approximately 16% per year). A similar calculation 
to that for Microsoft shows that the standard deviation of the change in the value of the 
portfolio in one day is 


5,000,000 x 0.01 = 50,000 
Assuming that the change is normally distributed, the one-day 99% VaR is 
50,000 X 2.326 = $116,300 


and the 10-day 99% VaR is 


116,300 x y 10 = $367,800 


The 10-day 99% ES is, from equation (12.2), $421,400. 


14.1.1 Two-Asset Case 


Now consider a portfolio consisting of both $10 million of Microsoft shares and 
$5 million of AT&T shares. We suppose that the returns on the two shares have a cor- 
relation of 0.3. A standard result in statistics tells us that, if two variables X and Y have 
standard deviations equal to 6, and oy with the coefficient of correlation between them 
being equal to p, then the standard deviation of X + Y is given by 


Oxy = 1/02 + 5%, + 2poxoy 
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To apply this result, we set X equal to the change in the value of the position in Microsoft 
over a one-day period and Y equal to the change in the value of the position in AT&T 
over a one-day period, so that 


6x = 200,000 6 = 50,000 


The standard deviation of the one-day change in the value of the portfolio we are con- 
sidering is therefore 


200,0002 + 50,0007 + 2 x 0.3 x 200,000 x 50,000 = 220,227 


The mean change is assumed to be zero. If we further assume that the joint distribution 
of the returns from Microsoft and AT&T is bivariate normal, the change in the portfolio 
is normally distributed. The one-day 99% VaR is therefore 


220,227 X 2.326 = $512,300 


When a period of 10 days is considered, both © y and oy, increase by a multiplicative factor 
of /10. As a result, VaR also increases by a multiplicative factor of /10 to become 
$1,620,100. The 10-day 99% ES is given by equation (12.2) with o = 220,227 4/10, 
Y = 2.326, and X = 0.99. It is $1,856,100. 


14.1.2 The Benefits of Diversification 

In the example we have just considered: 
1. The 10-day 99% VaR for the portfolio of Microsoft shares is $1,471,300. 
2. The 10-day 99% VaR for the portfolio of AT&T shares is $367,800. 


3. The 10-day 99% VaR for the portfolio of both Microsoft and AT&T shares is 
$1,620,100. 


The amount 
($1,471,300 + $367,800) — $1,620,100 = $219,000 


represents the benefits of diversification. If Microsoft and AT&T were perfectly corre- 
lated, the VaR for the portfolio of both Microsoft and AT&T would equal the VaR 
for the Microsoft portfolio plus the VaR for the AT&T portfolio. Less than perfect 
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correlation leads to some of the risk being “diversified away.” This is true of ES as well 
as VaR? 


14.2 Generalization 


The examples we have just considered are simple illustrations of the use of the linear 
model for calculating VaR. Suppose that we have a portfolio worth P that is dependent 
on n market variables. Following the market’s usual terminology, we will refer to market 
variables as risk factors. Examples of risk factors are equity prices, commodity prices, or 
exchange rates. (We defer consideration of interest rates, credit spreads, and volatilities 
until later in this chapter.) We suppose that, to a good approximation, the change in the 
value of the portfolio is linearly related to proportional changes in the risk factors so 
that 


AP = È ô; Ax, (14.1) 


i=1 


where AP is the dollar change in the value of the whole portfolio in one day and Ax; is 
the proportional change in the ith risk factor in one day. 

The parameter, 6,, is a variation on the delta risk measure explained in Chapter 8. 
The delta of a position with respect to a risk factor is normally defined as the ratio 
AP/AS where AS is a small change in the value of the risk factor (with all other risk 
factors remaining the same) and AP is the resultant change in the value of the portfolio. 
The parameter, 5;, we use here is AP/Ax; where Ax; is a small percentage change in the 
value of the risk factor (again with all other risk factors remaining the same) and AP is 
the resultant dollar change in the value of the portfolio. 

If we assume that the Ax; in equation (14.1) is multivariate normal, AP is normally 
distributed. To calculate VaR, we therefore need to calculate only the mean and standard 
deviation of AP. We assume, as discussed in the previous section, that the expected value 
of each Ax, is zero. This implies that the mean of AP is zero. 

The way the standard deviation of AP is calculated is an extension of the two-asset 
example in the previous section. We define 6; as the standard deviation of Ax;, and p; 
is the coefficient of correlation between Ax; and Ax,. (Because we are considering one 
day, 6; is the volatility per day.) The standard deviation of AP is given by 


(14.2) 


3 As discussed in Section 12.5, VaR does not always reflect the benefits of diversification. For non- 
normal distributions, the VaR of two portfolios considered jointly can be greater than the sum of their 
VaRs. ES does not have this disadvantage. 
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This equation can also be written as 


Op = 


> 5207 + > P5566; 


ifj 


or as 


2 5&0? +2 >» P5;5;6,0, 


j<i 


Yet another way of writing this key result is 


-> y cov, 5,8; (14.3) 


i=1 j= 


where cov; is the covariance between Ax; and Ax;,. Using matrix notation, this becomes 


o2 T 
o2 =8"Cő 


where 6 is the (column) vector whose ith element is 6;, C is the variance-covariance 
matrix (see Section 11.3), and ô” is the transpose of 6. 


The standard deviation of the change over T a is © ev T, and the VaR for a T-day 


time horizon and an X% confidence level is N~!(X)o X)opVT. T. The ES fora T-day horizon 
and an X% confidence level is, from equation hen 


2 
arp 


V2n(1 — X) 


where Y = N7'(X). In these results, NT! is the inverse cumulative normal distribution 
function (NORMSINV in Excel). 


oy Tt 


Example 14.1 


In the example considered in the previous section, the portfolio, P, had $10 million 
invested in the first asset (Microsoft) and $5 million invested in the second asset (AT&T). 
In this case, when amounts are measured in millions of dollars, 5, = 10,5, = 5, and 


AP = 10 Ax, +5 Ax, 
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Also, 6, = 0.02,6, = 0.01, and p,, = 0.3 so that 
o% = 10° x 0.027 + 5° x 0.017 +2 x 10 x 5 x 0.3 x 0.02 x 0.01 = 0.0485 


and op = 0.220. This is the standard deviation of the change in the portfolio value per day 


(in millions of dollars). The 10-day 99% VaR is 2.326 x 0.220 x y 10 = $1.62 million. 
The 10-day ES is: 


—2,3262/2) 


0.220 x yV10-——— e 


2m X 0.01 


or $1.86 million. These results agree with the calculations in the previous section. 


14.2.1 Relation to Markowitz 


The portfolio return in one day is AP/P. From equation (14.2), the variance of this is 


n n 


5 2 PjwiW; 0;0}; 


i=1 j=1 


where w; = 6,/P. When the portfolio consists of long and short positions in n assets with 
the ith risk factor being the value of the ith asset, 6; is the value of the investment in the 
ith asset (as shown in Example 14.1) and w; is therefore the proportion of the portfolio 
invested in the ith asset. This version of equation (14.2) corresponds to the work of 
Markowitz and is often used by portfolio managers to relate the standard deviation of 
the return on a portfolio to the standard deviation of the returns on the components 
of the portfolio and the correlations between the returns on the components of the 
portfolio. (See Section 1.1.) 


14.3 The Four-Index Example Revisited 


As an example of the application of equation (14.2) or (14.3), return to the example 
considered in Section 13.1. This involved a portfolio on September 25, 2008, con- 
sisting of a $4 million investment in the Dow Jones Industrial Average, a $3 million 
investment in the FTSE 100, a $1 million investment in the CAC 40, and a $2 million 
investment in the Nikkei 225. Daily returns were collected over 500 days ending on 
September 25, 2008. (Data and calculations presented here are on the author’s website: 
www-2.rotman.utoronto.ca/~hull/R MFI/VaR Example.) 
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Table 14.1 Correlation Matrix on September 25, 2008, Calculated by Giving 
the Same Weight to Each of the Last 500 Daily Returns (variable 1 is DJIA; 
variable 2 is FTSE 100; variable 3 is CAC 40; variable 4 is Nikkei 225) 


1 0.489 0.496 —0.062 
0.489 1 0.918 0.201 
0.496 0.918 1 0.211 

—0.062 0.201 0.211 1 


The correlation matrix, when calculations are carried out in the usual way with the 
same weight for all 500 returns, is shown in Table 14.1. The FTSE 100 and CAC 40 are 
very highly correlated. The Dow Jones Industrial Average (DJIA) is moderately highly 
correlated with both the FTSE 100 and the CAC 40. The correlation of the Nikkei 225 
with other indices is less high (and even negative with the DJIA). The covariance matrix 
is shown in Table 14.2. 

From equation (14.3), this variance-covariance matrix gives the variance of the port- 
folio losses ($000s) as 8,761.833. The standard deviation is the square root of this, or 93.60. 
The one-day 99% VaR is therefore 


2.326 X 93.60 = 217.757 


or $217,757, and the one-day 99% ES is 


e2326 /2 
93.60 X ——————-. = 249.476 
y 2r x 0.01 


or $249,476. This compares with a VaR of $253,385 and ES of $327,181, calculated 
using the basic historical simulation approach in Chapter 13. 


14.3.1 Use of EWMA 


Instead of calculating variances and covariances by giving equal weight to all observed 
returns, we now use the exponentially weighted moving average (EWMA) method with 


Table 14.2 Covariance Matrix on September 25, 2008, Calculated by Giving 
the Same Weight to Each of the Last 500 Daily Returns (variable 1 is DJIA; 
variable 2 is FTSE 100; variable 3 is CAC 40; variable 4 is Nikkei 225) 


0.0001227 0.0000768 0.0000767 —0.0000095 
0.0000768 0.0002010 0.0001817 0.0000394 
0.0000767 0.0001817 0.0001950 0.0000407 
—0.0000095 0.0000394 0.0000407 0.0001909 
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Table 14.3 Covariance Matrix on September 25, 2008, When EWMA with 
A = 0.94 Is Used (variable 1 is DJIA; variable 2 is FTSE 100; variable 3 is CAC 
40; variable 4 is Nikkei 225) 


0.0004801 0.0004303 0.0004257 —0.0000396 
0.0004303 0.0010314 0.0009630 0.0002095 
0.0004257 0.0009630  0.0009535 0.000168 1 
—0.0000396 0.0002095 0.0001681 0.0002541 


Table 14.4 Volatilities (% per day) on September 25, 2008, for the Equal 
Weighting and EWMA Approaches 


DJIA FTSE 100 CAC 40 Nikkei 225 
Equal weighting 1.11 1.42 1.40 1.38 
EWMA 2.19 3.21 3.09 1.59 


Table 14.5 Correlation Matrix on September 25, 2008, When EWMA 
Method Is Used (variable 1 is DJIA; variable 2 is FTSE 100; variable 3 is CAC 
40; variable 4 is Nikkei 225) 


1 0.611 0.629 —0.113 
0.611 1 0.971 0.409 
0.629 0.971 1 0.342 

—0.113 0.409 0.342 1 


à = 0.94. This gives the variance-covariance matrix in Table 14.3.4 From equation (14.3), 
the variance of portfolio losses ($000s) is 40,995.765. The standard deviation is the square 
root of this, or 202.474. The one-day 99% VaR is therefore 


2.326 X 202.474 = 471.025 


or $471,025, and the one-day ES is, from equation (12.2), $539,637. These are more than 
twice as high as the values given when returns are equally weighted. Tables 14.4 and 14.5 
show the reasons. The standard deviation of the value of a portfolio consisting of long 
positions in securities increases with the standard deviations of security returns and also 
with the correlations between security returns. Table 14.4 shows that the estimated daily 
standard deviations are much higher when EWMA is used than when data are equally 
weighted. This is because volatilities were much higher during the period immediately 
preceding September 25, 2008, than during the rest of the 500 days covered by the data. 
Comparing Table 14.5 with Table 14.3, we see that correlations had also increased? 


*In the EWMA calculations, the variance is initially set equal to the population variance. But all 
reasonable starting variances give similar results because in this case all we are interested in is the final 
variance. 

5 This is an example of the phenomenon that correlations tend to increase in stressed market conditions. 
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14.4 Handling Term Structures 


Variables such as interest rates, credit spreads, and at-the-money volatilities are described 
by term structures. The term structure shows the value of the variable as a function of 
time to maturity. In the case of an interest rate, the term structure shows the zero-coupon 
interest rate as a function of its maturity. (See Appendix B for a discussion of how zero- 
coupon interest rates are calculated.) In the case of a credit spread, the term structure 
shows the credit spread applicable to a bond or credit default swap as a function of its 
maturity. In the case of at-the-money volatilities, the term structure shows the volatility 
used to price an at-the-money option as a function of its maturity° 

Term structures create complications for the model-building approach. Consider, for 
example, a term structure for a particular interest rate. One instrument in a company’s 
portfolio might lead to a cash flow in 3.32 years so that the change in the value of 
the instrument over the next short period of time depends on what will happen to the 
3.32-year maturity point on the term structure; another instrument might lead to a cash 
flow at the 4.48-year point so that the financial institution also has an exposure to that 
point on the term structure. The 3.32-year rate and the 4.48-year rate will tend to move 
together, but they are not perfectly correlated. 

It is clearly not possible to consider separately every single maturity to which the 
financial institution is exposed. We will consider two alternative ways of handling term 
structure risk: principal components analysis and what we will refer to as the “multiple- 
vertices approach.” In the case of variables such as equity prices, commodity prices, 
and exchange rates, we considered proportional changes in the value of the variable, 
and the o-parameters were standard deviations of proportional changes (i-e., volatili- 
ties). For the variables we now consider, it is more usual to consider actual changes 
rather than proportional changes. This is what we do in both of the approaches we 
present. 


14.4.1 Principal Components Analysis 


We discussed the nature of a principal components analysis in Section 9.7. The first two 
or three principal components account for most of the variation in a term structure that 
is observed in practice. The first component is usually a roughly parallel movement in 
the term structure; the second is a change in the slope of the term structure; the third is 
a “bowing” where the curvature of the term structure changes. 

One approach to handling term structures is to assume that the change in a term 
structure in one day arises from only the first two or three principal components. We 
illustrate the calculations using an interest rate example from Chapter 9. The data in 
Table 14.6 provide the sensitivities of a portfolio to rate moves. The factor loadings and 
standard deviation of factor scores are in Tables 14.7 and 14.8. (These tables also appear 


é As we explain later, plain vanilla options prices are approximately linearly dependent on volatility, so 
their dependence on volatility can be handled using equation (14.2). 
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Table 14.6 Change in Portfolio Value for a One-Basis-Point Rate Move ($ millions) 
3-Year Rate 4-Year Rate 5-Year Rate 7-Year Rate 10-Year Rate 


+10 +4 -8 <7 +2 


Table 14.7 Factor Loadings for Swap Data 
PC1 PC2 PC3 PC4 PC5 PC6 PC7 PC8 


1-year 0.216 —0.501 0.627 —0.487 0.122 0.237 0.011 —0.034 
2-year 0.331 —0.429 0.129 0.354 —0.212 —0.674 —0.100 0.236 
3-year 0.372 —0.267 —0.157 0.414 —0.096 0.311 0.413 —0.564 
4-year 0.392 —0.110 —0.256 0.174 —0.019 0.551 —0.416 0.512 
5-year 0.404 0.019 —0.355 —0.269 0.595 —0.278 —0.316 —0.327 
7-year 0.394 0.194 —0.195 —0.336 0.007 —0.100 0.685 0.422 
10-year 0.376 0.371 0.068 —0.305 —0.684 —0.039 —0.278 —0.279 
30-year 0.305 0.554 0.575 0.398 0.331 0.022 0.007 0.032 


as Tables 9.6 and 9.7.) We assume that the first two principal components describe rate 
moves. The exposure to the first principal component for these data is 


10 X 0.372 + 4 x 0.392 — 8 x 0.404 — 7 x 0.394 + 2 x 0.376 = +0.05 
and the exposure to the second factor is 
10 x (—0.267) + 4 x (—0.110) — 8 x 0.019 — 7 x 0.194 + 2 x 0.371 = —3.88 


Both exposures are measured in millions of dollars per basis point. 

Suppose that f; and fọ are the factor scores for the first two principal components. 
These can be thought of as the amount of first and second principal components in a 
daily change. The change in the portfolio value ($ millions) in one day as a result of the 
first two principal components is 


AP = 0.05f, — 3.88f, 


The factor scores in a principal components analysis are uncorrelated. From Table 14.8, 
the standard deviations of the first two factor scores ($ millions) are 17.55 and 4.77. The 
standard deviation of AP ($ millions) is, therefore, 


V 0.052 X 17.552 + 3.882 x 4.772 = 18.52 


Assuming the factor scores are normally distributed, the one-day 99% VaR is, therefore, 
$18.52 x 2.326 = $43.08 million. 


Table 14.8 Standard Deviation of Factor Scores 
PC1 PC2 PC3 PC4 PC5 PC6 PC7 PC8 


17.55 4.77 2.08 1.29 0.91 0.73 0.56 0.53 
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Note that the portfolio we are considering has very little exposure to the first factor 
and significant exposure to the second factor. Using only one factor would significantly 
understate VaR. (See Problem 14.9.) 

The example we have just considered has exposures to only the maturities in the 
table of factor loadings, Table 9.7. Exposures to other maturities can be handled using 
interpolation. For example, the 3.5-year rate would be assumed to have a PC1 factor 
loading of 0.382, a PC2 factor loading of —0.1885, and so on. 


14.4.2 The Multiple Vertices Approach 


Derivatives dealers and their regulators like to consider multiple points on a term 
structure when risk measures are calculated. Again we illustrate this for a term structure 
of interest rates. Suppose that the term structure is defined by points corresponding to 
the maturities: 3 months, 6 months, 1 year, 2 years, 3 years, 5 years, 10 years, 15 years, 
20 years, and 30 years. The term structure is a piecewise-linear curve joining the points 
as explained in Appendix B. A delta is calculated for each point on the term structure by 
shifting that point by one basis point while keeping the others unchanged. We will refer 
to deltas defined in this way as “node deltas.” (To use the terminology of Chapter 9, they 
are “cash partial durations.”) Figure 14.1 indicates the term structure change when the 
five-year node delta is calculated. The deltas with respect to other points are calculated 
similarly. When the delta for the shortest maturity point (three months) is calculated, 
all rates less than the shortest maturity are increased by one basis point; when the delta 
with respect to the longest maturity point (30 years) is calculated, all rates greater than 
the longest maturity are increased by one basis point. (This is consistent with the usual 
convention for constructing the term structures described in Appendix B.) The result 
of all this can be seen to be that the sum of the node deltas is DVO1 (which is the effect 
of a parallel one-basis-point shift in the whole term structure). Node deltas are therefore 
a way of dividing the DV01 into 10 component parts. 


Rate (%) 


Maturity (years) 


0 5 10 15 20 25 30 30 


Figure 14.1 Five-Year Rate Is Changed with Other Rates Kept Constant 
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Consider a portfolio whose value depends on only one term structure. Define ô; 
as the ith node delta. This is the increase in the value of the portfolio when there is a 
one-basis-point increase in the interest rate corresponding to the ith maturity. If ©; is 
the standard deviation of a one-day change in the rate corresponding to the ith node 
(measured in basis points) and p; is the correlation between movements in rates at the 
ith and jth nodes, the standard deviation of the change in the value of the portfolio in 
one day is 


(14.4) 


Any given future cash flow has a delta with respect to the two adjacent maturities. 
For example, a cash flow in 3.5 years has a delta with respect to the 3-year rate and the 
5-year rate, but none with respect to the other rates. When the impact of a one-basis- 
point change in the 3.5-year rate is X, there is a delta of 0.75X for the 3-year node and 
0.25.X for the 5-year node. 

Note that equation (14.4) is the same as equation (14.2). The only difference is 
that equation (14.2) works with proportional changes in the risk factors (the 6-variables 
measure the impact on the portfolio of a proportional changes, and the o-variables are 
volatilities), whereas equation (14.4) works with actual changes in the risk factors (the 
-variables measure the impact of actual changes in the risk factors and the o-variables 
are standard deviations). Equations (14.2) and (14.4) can be extended to cover a portfolio 
where proportional changes are considered for some risk factors and actual changes are 
considered for others. 

Suppose now that portfolio P depends on K term structures. Define 6, Oig, and Pj, 
as the values of 5;,6;, and p; for term structure k (1 < k < K). Also define 


N, N, 


2 
Ve = DD, Pind une 


i=1 j=1 


N, 
U, = >, Ök ik 
i=1 


where N, is the number of vertices used for term structure k. 

It is usual to describe the correlation between the two term structures with a single 
parameter. Suppose that p(k,,k,) is the correlation between term structures kų and k}. 
Two ways of defining it are as follows: 


1. The correlation between rate i for term structure k, and rate j for term structure k, 
is P(k,, k>) for all i and j. 

2. The change in the value of the portfolio due to movements in term structure k, and 
the change in the value of the portfolio due to movements in term structure k, have 
a correlation of p(k}, k>). 
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The first definition leads to 


op= | V2 + DY. pik ks) Uy, U, (14.5) 
k kik, 
whereas the second gives 
op= [V2 + DY pkp ke) Va, Ve, (14.6) 
k 


Ri #k, 


Example 14.2 


Suppose that a portfolio has exposures to two different interest rate term structures. The 
standard deviations of rate moves per day and the delta exposures to rate moves are shown 
in Table 14.9. For example, the standard deviation of daily changes in the two-year rate 
in term structure 1 is 5.6 basis points (0.056%), while that for term structure 2 is 11.4 
basis points. The impact of a one-basis-point change in the two-year rate is $85 million 
for term structure 1 and $65 million for term structure 2. We assume that correlations 
P; are the same for both term structures and are those shown in Table 14.10. Thus the 
correlation between the change in the one-year rate in term structure 1 and the change 
in the two-year rate in term structure 1 is 0.92. The same is true for term structure 2. 


Table 14.9 Standard Deviations (in basis points) and Delta Exposures (in millions of dollars per 
basis point) 


3m 6m lyr 2yr 3yr 5yr 10yr 15yr 20yr 30yr 


TS1 SD per day 8.8 7.4 6.7 5.6 5.4 5.4 5.2 5.2 289 6.4 
TS2 SD per day 10.2 10.8 12.0 114 11.0 114 10.0 11.2 11.2 113 
Delta TS1 55 65 80 85 90 70 65 40 20 5 
Delta TS2 85 75 70 65 50 45 40 30 20 20 


Table 14.10 Correlations between Rates of Different Maturities 


3m 6m lyr 2yr 3yr 5yr 10yr 15yr 20yr 30yr 


3m 1.00 0.78 0.62 0.50 0.44 0.36 0.27 0.20 0.17 0.13 
6m 0.78 1.00 0.84 0.74 0.67 0.57 0.44 0.37 0.35 0.30 
lyr 0.62 0.84 1.00 0.92 0.86 0.76 0.63 0.55 0.53 0.47 
2yr 0.50 0.74 0.92 1.00 0.98 0.89 0.75 0.69 0.66 0.60 
3yr 0.44 0.67 0.86 0.98 1.00 0.96 0.83 0.78 0.75 0.69 
5yr 0.36 0.57 0.76 0.89 0.96 1.00 0.92 0.89 0.86 0.81 
10yr 0.27 0.44 0.63 0.75 0.83 0.92 1.00 0.98 0.96 0.93 
15yr 0.20 0.37 0.55 0.69 0.78 0.89 0.98 1.00 0.99 0.97 
20yr 0.17 0.35 0.53 0.66 0.75 0.86 0.96 0.99 1.00 0.99 
30yr 0.13 0.30 0.47 0.60 0.69 0.81 0.93 0.97 0.99 1.00 
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Finally, we assume that the correlation parameter between the two term structures is 0.4. 
In millions of dollars, the results are: 


U, =3,529 U, =5,501 


V, =3,004.9 V, = 4,604.3 


The portfolio standard deviation per day given by equation (14.5) is 6,163.9, and the 
portfolio standard deviation given by equation (14.6) is 5,980.2. These must be multiplied 
by v10 and 2.326 to get the 10-day 99% VaR. The expected shortfall can be calculated 
using equation (12.2). The calculations are shown on the author’s website: 


www-2.rotman.utoronto.ca/~hull/riskman. 


14.5 Extensions of the Basic Procedure 


In this section we continue to consider the situation where a portfolio is linearly depen- 
dent on the underlying risk factors and explain some extensions of the model in equations 
(14.2) and (14.4). 


14.5.1 Stressed Measures 


Suppose we are interested in stressed VaR or stressed ES. As discussed in Section 13.1.3, 
this is an estimate based on data from a stressed period in the past, not from a recent period. 
The methods we have proposed so far in this chapter can be modified by estimating 
volatilities and correlations using data from a stressed period rather than from a recent 
period. 


14.5.2 Non-Normal Distributions 


In Section 12.9 we explained that an approximate formula for aggregating the VaRs for 
n portfolios is 


n n 


DD, PaVaR; VaR; 


i=1 j=l 


VaR total = (14.7) 


total 


where VaR; is the VaR for the ith portfolio, VaR ot 18 the total VaR, and p; is the 
correlation between losses from the ith and jth portfolios. The formula shows that the 
method for aggregating standard deviations can also, as an approximation, be used to 
aggregate percentiles. 
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The result suggests a way the model-building approach can be extended to allow 
for return distributions being non-normal. Suppose that, based on historical data, we 
estimate that the one-percentile point of the 10-day return distribution for Microsoft is 
—17% so that the 10-day 99% VaR of $10 million invested in Microsoft is $1,700,000 
(rather than the $1,471,300 calculated assuming normally distributed returns in Section 
14.1). Suppose further that we estimate that the one-percentile point of the 10-day return 
distribution for AT&T is —10% so that the 10-day 99% VaR for $5 million invested in 
AT&T is $500,000 (rather than the $367,800 estimated assuming normally distributed 
returns). Using equation (14.7) we would adjust our estimate of the VaR of the portfolio 
of Microsoft and AT&T to 


y 1,700,000? + 500,000? + 2 x 0.3 x 1,700,000 x 500,000 = 1,910,500 


14.6 Risk Weights and Weighted Sensitivities 


When equations such as (14.2) and (14.4) are used to calculate VaR or ES, the standard 
deviation of the daily change in the portfolio is multiplied by a constant. For example, 


when the 10-day 99% VaR is calculated, it is multiplied by N~!(0.99) x 10 = 7.36. 
When the 20-day 97.5% ES is calculated, it is multiplied by 


pce (0.99) /2} _ P 


/2n x 0.01 


Suppose that B is the multiplier so that: 


n n 


Risk Measure = B a by 5;5;P9,9; 
i=1 j=1 


Defining W, = Bo;. This can be written: 


Risk Measure = (14.8) 


The W-parameters are referred to as risk weights, and the 6-parameters are referred 
to as weighted sensitivities. 
As we shall see in later chapters, this formulation is used for 


1. Determining capital for market risk in the Fundamental Review of the Trading Book 
using the standardized approach 
2. Determining initial margin in bilaterally cleared transactions 
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14.7 Handling Non-Linearity 


We now consider the situation where the change in the value of a portfolio cannot 
reasonably be considered to be linearly dependent on changes in the underlying risk 
factors. This would be the case for a portfolio containing options. Non-linear portfolios 
have vega and gamma exposure. Most portfolios are to a reasonable approximation linear 
in volatility. The vega exposure can therefore be taken into account as described earlier 
by considering potential movements in the volatility term structure. 

Gamma exposure is more difficult because it gives rise to a quadratic term. Consider a 
portfolio consisting of a single option on an asset with price S. A Taylor series expansion 
shows that it is approximately true that 


oP oP 10°P 
AP = — AS + —Ao + =~-—(AS 14. 
Os * 36 ot Fas! j (ea 


The final term in this expansion cannot be ignored when VaR or ES is being calculated. 
Figure 14.2 illustrates this by showing the relationship between the value of a long call 
option and the price of the underlying asset. A long call is an example of an option 
position with positive gamma. The figure shows that, when the probability distribution 
for the price of the underlying asset at the end of one day is normal, the probability 
distribution for the option price is positively skewed. Figure 14.3 shows the relationship 
between the value of a short call position and the price of the underlying asset. A short 
call position has a negative gamma. In this case, we see that a normal distribution for the 


A Value of 
long call 


SS ee ee Price of 
==- underlying asset 


Figure 14.2 Translation of a Normal Probability Distribution for an Asset into a Probability 
Distribution for Value of a Long Call on the Asset 
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Figure 14.3 Translation of a Normal Probability Distribution for an Asset into a Probability 
Distribution for Value of a Short Call on the Asset 


price of the underlying asset at the end of one day gets mapped into a negatively skewed 
distribution for the value of the option position. 

The VaR for a portfolio is critically dependent on the left tail of the probability 
distribution of the portfolio value. For example, when the confidence level used is 99%, 
the VaR is the value in the left tail below which only 1% of the distribution resides. As 
indicated in Figure 14.2, a positive gamma portfolio tends to have a less heavy left tail 
than the normal distribution. If the distribution is assumed to be normal, the calculated 
VaR will tend to be too high. Similarly, as indicated in Figure 14.3, a negative gamma 
portfolio tends to have a heavier left tail than the normal distribution. If the distribution 
is assumed to be normal, the calculated VaR will tend to be too low. 

For a portfolio with n underlying risk factors, equation (14.1) becomes 


n n 


AP = 5 EDID si Ax;Ax; (14.10) 
i=1 


i=1 j=l 
where y; is a “cross gamma” defined as 


_ oP 
Ox,Ox; 


Vij 


If each instrument in the portfolio depends on only one risk factor, there are no cross 
gammas so that y; = 0 except when i = j. 
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14.7.1 Monte Carlo Simulation 


One way of handling non-linearity is to use Monte Carlo simulation to generate the 
probability distribution for AP. Suppose we wish to calculate a one-day VaR for a port- 
folio. The procedure is as follows: 


1. Value the portfolio today in the usual way using the current values of risk factors. 

2. Sample once from the multivariate normal probability distribution of the Ax,” 

3. Use the sampled values of the Ax; to determine the value of each risk factor at the 
end of one day. 

4. Revalue the portfolio at the end of the day in the usual way. 

5. Subtract the value calculated in step 1 from the value in step 4 to determine a sample 
AP. 

6. Repeat steps 2 to 5 many times to build up a probability distribution for AP. 


VaR or ES can then be calculated from the probability distribution of AP. Sup- 
pose, for example, that we calculate 5,000 different sample values of AP in the way just 
described. The one-day 99% VaR is the value of AP for the 50th worst outcome, the 
one-day 99% ES is average of the outcomes worse than this, and so on The N-day VaR 


is usually assumed to be the one-day VaR multiplied by VN 

The drawback of Monte Carlo simulation is that it tends to be computationally slow 
because a company’s complete portfolio (which might consist of hundreds of thousands 
of different instruments) has to be revalued many times. One way of speeding things up is 
to assume that equation (14.10) describes the relationship between AP and the Ax,. (This 
is similar to what is proposed for the historical simulation approach in Section 13.4.) We 
can then jump straight from step 2 to step 5 in the Monte Carlo simulation and avoid 
the need for a complete revaluation of the portfolio. This is sometimes referred to as the 
partial simulation approach. 


14.7.2 Extension 


An attraction of Monte Carlo simulation is that we do not have to assume the risk 
factors are normally distributed. We can assume any distribution we like for the risk 
factors and use a multivariate copula to define the correlations between variables.!° If we 
use a Gaussian copula in conjunction with non-normal unconditional distributions, we 


7 One way of doing so is given in Section 11.4.1. 

8 As in the case of historical simulation, extreme value theory can be used to “smooth the tails” so that 
better estimates of extreme percentiles are obtained. 

? This is only approximately true when the portfolio includes options, but it is the assumption that is 
made in practice for most VaR calculation methods. 

10 See J. Hull and A. White, “Value at Risk When Daily Changes Are Not Normally Distributed,” 
Journal of Derivatives 5, no. 3 (Spring 1998): 9-19. 
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can follow the five steps given earlier except that step 2 is changed and a step is inserted 
between step 2 and step 3 as follows: 


2. Sample variables u; from the multivariate normal probability distribution. 
2a. Transform each u; to Ax, on a percentile-to-percentile basis. 


14.7.3 Cornish—Fisher Expansions 


A result in statistics known as the Cornish—Fisher expansion can be used to estimate 
percentiles of a probability distribution from its moments. Define up and Op as the mean 
and standard deviation of AP so that 


Hp = E(AP) 


o2 = E[(AP)] — [E(AP)? 


2 
P 
The skewness of a probability distribution is related to the third moment. Positive skew- 
ness indicates that the right tail of the distribution is heavier than the left tail. Negative 
skewness indicates the reverse. The skewness of the probability distribution of AP, &p, is 
usually defined as 


Ep = LEKAP — pp)’ 
OD 


The skewness of a normal distribution is zero. 

The kurtosis of a probability distribution is related to the fourth moment. It measures 
the heaviness of the tails of the distribution. Similarly to skewness, the kurtosis of the 
probability distribution of AP, Kp, is usually defined as 


1 
Kp = —E[(AP — pp)“] 
Op 


The kurtosis of a normal distribution is 3. The excess kurtosis measures the kurtosis relative 
to the normal distribution and is defined as the kurtosis minus 3. 

Using the first three moments of AP, the Cornish—Fisher expansion estimates the 
q-percentile of the distribution of AP as 


Hp + WO p 


where 
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and z, is q-percentile of the standard normal distribution. Accuracy is improved when 
higher moments are used. For example, when kurtosis is considered, the expression for 
w, becomes: 


Example 14.3 


Suppose that for a certain portfolio we calculate pp = —0.2, Op = 2.2, and Ep = —0.4, 


and we are interested in the one percentile (q = 0.01). In this case, z, = —2.326. If we 


assume that the probability distribution of AP is normal, then the one percentile is 
—0.2 = 2.326 X 2.2 = —5.318 
In other words, we are 99% certain that 
AP > —5.318 


When we use the Cornish—Fisher expansion to adjust for skewness and set q = 0.01, we 
obtain 


w, = —2.326 — 1 (2.326? — 1) x 0.4 = —2.620 


so that the one percentile of the distribution is 
—0.2 = 2.620 X 2.2 = —5.965 


Taking account of skewness, therefore, changes the VaR from 5.318 to 5.965. Suppose 
now that we also know that the kurtosis, Kp, is 3.3. This changes our estimate of w, to 


w, = —2.620 + Z (-2.326° + 3 X 2.326) x (3.3 — 3) = 2.691 


so that the estimate of the one percentile becomes 


—0.2 — 2.691 X 2.2 = 6.119 
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14.7.4 Isserlis’ Theorem 


To apply the Cornish—Fisher result, we need to calculate moments of AP. When the Ax, 
are normal, Isserlis’ theorem can in principle be used for this. The theorem states that if 
the X; are zero-mean normal distributions 


BK aK =n 
when n is odd and 
E(X,X,...X,) = )) TE(X;X) 


when n is even. The notation $, IHE(X;X;) describes the result of (a) listing the distinct 
ways in which X}, X>, ... , X, can be partitioned into n/2 pairs; (b) within each partition, 
calculating E(X;X;) for each pair (X, X) and multiplying these expectations together; and 
(c) summing the results across partitions. 

For example, 


E(X1X2X;X4) = E(X,Xp)E(X3X4) + E(X1X)E(X2X4) + E(X X4)E(XX3) 
The expression for E(X,X,X3X,X;X;,) has 15 terms, one of which is 
E(X Xp) E(X3.X4) E(X5Xo) 


Isserlis’ theorem is a potentially attractive result because we know the expectation of 
the product of a pair of zero-mean normally distributed variables, with standard devia- 
tions Oy and oy, is their covariance: E(XY) = po,6y, where p is the correlation between 


them. 
Since the product of n zero-mean normally distributed variables is zero when n is 


odd, equation (14.10) gives 
1 
=3 2, VP iii} 
ij 


E(AP’) = py 5,5,0;6,P; + — ath Ne ElAx Ax AxA) 
tJ, 


E(AP*) = D) 5,5 VE (Ax; Ax; Ax, Ax)) + L 5 VV eV mn EAX; Ax Ax Ax Ax Ax, 
24 kl i j,k,l,m,n 
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When there is only one risk factor so that we can dispense with the subscripts to o, 
p, 5, and y, these equations reduce to: 


E(AP) = Lyo? 


E(AP?) = 0? + “yo! 


E(AP*) = 2 5240 po 
2 8y3%0° 

Unfortunately, the number of terms increases quickly as the number of risk factors 
increases, so that the calculation of the third moment becomes very time-consuming 
for even modest numbers of risk factors. 

One manageable application of Isserlis’ theorem that takes gamma into account is to 
base VaR or ES on the first two moments and assume no cross gamma. This means that 
Yy = 0 when i ¥ j. Simplifying the notation, we set y; = y; to obtain 


E(AP) = L » 1:0; (14.11) 


| 
E(A P’) = 2 5,5,0,6;p; + ri ba YE (axax) 
iy 1J 


From Isserlis’ theorem, 


E(Ax; Ax) = 2[F(Ax,Ax)]° + E(x;) E(x) = 2p7070; +070; 


iji 


It follows that the standard deviation of AP is 


SD(AP) = y E(AP?) — (E(AP)? = >; 5,5;6,0,p, + ; > P5V:¥0; 9; (14.12) 
ij ij 


We will see an application of this when we discuss the Standard Initial Margin Model 
(SIMM) in Chapter 17. 


14.8 Model-Building vs. Historical Simulation 


In the preceding chapter and in this one, we have discussed two methods for estimating 
VaR: the historical simulation approach and the model-building approach. The advan- 
tages of the model-building approach are that results can be produced very quickly, 
and they can easily be used in conjunction with volatility and correlation updating 
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procedures such as those described in Chapters 10 and 11. (As mentioned in Section 
13.3, volatility changes can be incorporated into the historical simulation approach—but 
in a rather more artificial way.) The main disadvantage of the model-building approach 
is that quick results can be produced only when the change in the value of the port- 
folio is linearly related to the proportional or actual changes in the risk factors and the 
daily changes in the risk factors are assumed to have a multivariate normal distribution. 
In practice, daily changes in risk factors often have distributions that are quite different 
from normal. (See, for example, Table 10.1.) A user of the model-building approach can 
then use the approach in Section 14.5.2, hoping that some form of the central limit 
theorem of statistics applies so that the probability distribution of daily gains/losses on a 
large portfolio is normally distributed, even though the gains/losses on the component 
parts of the portfolio are not normally distributed. 

The model-building approach is in practice most often used for investment portfo- 
lios. (It is, after all, closely related to the popular Markowitz mean-variance method of 
portfolio analysis.) It is less commonly used for calculating the VaR for the trading oper- 
ations of a financial institution because of the difficulty of taking gamma into account. It 
might be thought that the impact of gamma on VaR or ES is much less than the impact 
of delta. However, this is often not the case because, as explained in Chapter 8, financial 
institutions like to hedge positions so that deltas are close to zero. 


Summary 


Whereas historical simulation lets the data determine the joint probability distribution of 
daily percentage changes in risk factors, the model-building approach assumes a particular 
form for this distribution. The most common assumption is that percentage changes or 
actual changes in the risk factors have a multivariate normal distribution. For situations 
where the change in the value of the portfolio is linearly dependent on changes in the 
risk factors, VaR and ES can then be calculated very quickly. Other situations are more 
difficult to handle. One approach for implementing the model-building approach so that 
any portfolio can be accommodated is Monte Carlo simulation, but then the approach 
is computationally much slower. 

The model-building approach is frequently used for investment portfolios. It is less 
popular for the trading portfolios of financial institutions because of the difficulty in 
modeling gamma. 
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Practice Questions and Problems 
(Answers at End of Book) 


Consider a position consisting of a $100,000 investment in asset A and a $100,000 
investment in asset B. Assume that the daily volatilities of both assets are 1% and 
that the coefficient of correlation between their returns is 0.3. What are the five- 
day 97% VaR and ES for the portfolio? 

Describe two ways of handling interest-rate-dependent instruments when the 
model-building approach is used to calculate VaR. 

Suppose that the value ofa portfolio increases by $50,000 for each one-basis-point 
increase in the 12-year rate and has no other sensitivities. The multiple-vertex 
approach is used to model with the following vertices: 3 months, 6 months, 1 year, 
2 years, 3 years, 5 years, 10 years, 15 years, 20 years, and 30 years. What is the 
sensitivity of the portfolio to a one-basis-point increase in each vertex of the 
term structure? 

A financial institution owns a portfolio of instruments dependent on the U.S. 
dollar—sterling exchange rate. The delta of the portfolio with respect to percentage 
changes in the exchange rate is 3.9. If the daily volatility of the exchange rate is 
0.7% and a linear model is assumed, estimate the 10-day 99% VaR. 

Suppose that you know the gamma of the portfolio in Problem 14.4 (calculated 
with respect to percentage changes) is 4.3. How does this change your estimate 
of the relationship between the change in the portfolio value and the percentage 
change in the exchange rate? 

A portfolio has exposure to the two-year interest rate and the five-year interest 
rate. A one-basis-point increase in the two-year rate causes the value of the port- 
folio to increase in value by $10,000. A one-basis-point increase in the five-year 
rate causes the value of the portfolio to decrease by $8,000. The standard devi- 
ation per day of the two-year rate and that of the five-year rate are 7 and 8 basis 
points, respectively, and the correlation between the two rates is 0.8. What is the 
portfolio’s expected shortfall when the confidence level is 98% and the time 
horizon is five days? 

Explain how the terms risk weights and risk sensitivity are used in connection with 
the model-building approach. 

Suppose that the daily change in the value of a portfolio is, to a good approxima- 
tion, linearly dependent on two factors, calculated from a principal components 
analysis. The delta of a portfolio with respect to the first factor is 6 and the delta 
with respect to the second factor is —4. The standard deviations of the factor are 
20 and 8, respectively. What is the five-day 90% VaR? 

The text calculates a VaR estimate for the example in Table 14.6 assuming two 
factors. How does the estimate change if you assume (a) one factor and (b) three 
factors? 

A bank has a portfolio of options on an asset. The delta and gamma of the options 
(with respect to actual changes in the asset price) are —30 and —5, respectively. 
Explain how these numbers can be interpreted. What are the delta and gamma 
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14.11 


14.12 


14.13 


14.14 


14.15 


14.16 


14.17 


14.18 


with respect to proportional changes in the asset price? The asset price is 20 and 
its volatility is 1% per day. Using Isserlis’ theorem, calculate the first three moments 
of the change in the portfolio value. Calculate a one-day 99% VaR (a) using the 
first two moments and (b) using the first three moments in conjunction with the 
Cornish—Fisher expansion. 

Suppose that in Problem 14.10 the vega of the portfolio is —2 per 1% change in 
the annual volatility. Derive a model relating the change in the portfolio value in 
one day to delta, gamma, and vega. 

Explain why the linear model can provide only approximate estimates of VaR for 
a portfolio containing options. 

Some time ago, a company entered into a forward contract to buy £1 million for 
$1.5 million. The contract now has six months to maturity. The daily volatility of 
a six-month zero-coupon sterling bond (when its price is translated to dollars) is 
0.06%, and the daily volatility of a six-month zero-coupon dollar bond is 0.05%. 
The correlation between returns from the two bonds is 0.8. The current exchange 
rate is 1.53. Calculate the standard deviation of the change in the dollar value of the 
forward contract in one day. What is the 10-day 99% VaR? Assume that the six- 
month interest rate in both sterling and dollars is 5% per annum with continuous 
compounding. 

The calculations in Section 14.3 assume that the investments in the DJIA, FTSE 
100, CAC 40, and Nikkei 225 are $4 million, $3 million, $1 million, and $2 million, 
respectively. How do the VaR and ES change if the investment is $2.5 million 
in each index? Carry out calculations when (a) volatilities and correlations are 
estimated using the equally weighted model and (b) when they are estimated using 
the EWMA model with à = 0.94. Use the spreadsheets on the author’s website. 
What is the effect of changing A from 0.94 to 0.97 in the EWMA calculations in 
Section 14.3? Use the spreadsheets on the author’s website. 

Explain two alternative ways of defining the correlation between two term struc- 
tures using a single correlation parameter when there are multiple vertices. 


Further Questions 


Consider a position consisting of a $300,000 investment in gold and a $500,000 
investment in silver. Suppose that the daily volatilities of these two assets are 1.8% 
and 1.2%, respectively, and that the coefficient of correlation between their returns 
is 0.6. What are the 10-day 97.5% VaR and ES for the portfolio? By how much 
does diversification reduce the VaR and ES? 

Consider a portfolio of options on a single asset. Suppose that the delta of the 
portfolio (calculated with respect to actual changes) is 12, the value of the asset is 
$10, and the daily volatility of the asset is 2%. What is the delta with respect to 
proportional changes? Estimate the one-day 95% VaR for the portfolio from the 
delta. 
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14.19 


14.20 


14.21 


14.22 


14.23 


Suppose that you know that the gamma of the portfolio in Problem 14.18 (again 
measured with respect to actual changes) is —2.6. What is the gamma with respect 
to proportional changes? Derive a quadratic relationship between the change in 
the portfolio value and the percentage change in the underlying asset price in one 
day. 
(a) Use Isserlis’ theorem to calculate the first three moments of the change in the 
portfolio value. 
(b) Using the first two moments and assuming that the change in the portfolio is 
normally distributed, calculate the one-day 95% VaR for the portfolio. 
(c) Use the third moment and the Cornish—Fisher expansion to revise your answer 
to (b). 
A company has a long position in a two-year bond and a three-year bond, as well 
as a short position in a five-year bond. Each bond has a principal of $100 million 
and pays a 5% coupon annually. Calculate the company’s exposure to the one- 
year, two-year, three-year, four-year, and five-year rates. Use the data in Tables 
14.7 and 14.8 to calculate a 20-day 95% VaR on the assumption that rate changes 
are explained by (a) one factor, (b) two factors, and (c) three factors. Assume that 
the zero-coupon yield curve is flat at 5%. 
A company has a position in bonds worth $6 million. The modified duration of 
the portfolio is 5.2 years. Assume that only parallel shifts in the yield curve can 
take place and that the standard deviation of the daily yield change (when yield 
is measured in percent) is 0.09. Use the duration model to estimate the 20-day 
90% VaR for the portfolio. Explain carefully the weaknesses of this approach to 
calculating VaR. Explain two alternatives that give more accuracy. 
A bank has written a European call option on one stock and a European put 
option on another stock. For the first option, the stock price is 50, the strike price 
is 51, the volatility is 28% per annum, and the time to maturity is nine months. 
For the second option, the stock price is 20, the strike price is 19, the volatility 
is 25% per annum, and the time to maturity is one year. Neither stock pays a 
dividend, the risk-free rate is 6% per annum, and the correlation between stock 
price returns is 0.4. Calculate a 10-day 99% VaR (a) using only deltas, (b) using 
the partial simulation approach, and (c) using the full simulation approach. 
The calculations in Section 14.3 assume that the investments in the DJIA, FTSE 
100, CAC 40, and Nikkei 225 are $4 million, $3 million, $1 million, and $2 million, 
respectively. How do the VaR and ES change if the investments are $3 million, 
$3 million, $1 million, and $3 million, respectively? Carry out calculations when 
(a) volatilities and correlations are estimated using the equally weighted model 
and (b) when they are estimated using the EWMA model. What is the effect of 
changing A from 0.94 to 0.90 in the EWMA calculations? Use the spreadsheets 
on the author’s website. 


Part Three 


REGULATION 


Chapter 15 


Basel I, Basel II, and 
Solvency II 


n agreement in 1988, known as the Basel Accord, marked the start of interna- 
A tional standards for bank regulation. Since 1988, bank regulation has been an 

evolutionary process. New regulations have modified previous regulations, but 
often approaches used in previous regulations have been preserved. In order to under- 
stand the current regulatory environment, it is therefore necessary to understand histor- 
ical developments. This chapter explains the evolution of the regulatory environment 
prior to the 2007 credit crisis. Chapters 16 and 17 will cover developments since the 
crisis, and Chapter 18 will cover a planned future development. 

This chapter starts by explaining the 1988 Basel Accord (now known as Basel I), 
netting provisions, and the 1996 Amendment. It then moves on to discuss Basel II, which 
is a major overhaul of the regulations and was implemented by many banks throughout 
the world in about 2007. Finally, it reviews Solvency II,a new regulatory framework for 
insurance companies, which is broadly similar to Basel II and was implemented by the 
European Union in 2016. 


15.1 The Reasons for Regulating Banks 
The main purpose of bank regulation is to ensure that a bank keeps enough capital for the 


risks it takes. It is not possible to eliminate altogether the possibility of a bank failing, but 
governments want to make the probability of default for any given bank very small. By 
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doing this, they hope to create a stable economic environment where private individuals 
and businesses have confidence in the banking system. 

It is tempting to argue: “Bank regulation is unnecessary. Even if there were no reg- 
ulations, banks would manage their risks prudently and would strive to keep a level 
of capital that is commensurate with the risks they are taking.’ Unfortunately, history 
does not support this view. There is little doubt that regulation has played an impor- 
tant role in increasing bank capital and making banks more aware of the risks they are 
taking. 

As discussed in Section 2.3, governments provide deposit insurance programs to pro- 
tect depositors. Without deposit insurance, banks that took excessive risks relative to their 
capital base would find it difficult to attract deposits. However, the impact of deposit 
insurance is to create an environment where depositors are less discriminating. A bank 
can take large risks without losing its deposit base.! The last thing a government wants 
is to create a deposit insurance program that results in banks taking more risks. It is 
therefore essential that deposit insurance be accompanied by regulation concerned with 
capital requirements. 

A major concern of governments is what is known as systemic risk. This is the risk 
that a failure by a large bank will lead to failures by other large banks and a collapse of the 
financial system. The way this can happen is described in Business Snapshot 15.1. When 
a bank or other large financial institution does get into financial difficulties, governments 
have a difficult decision to make. If they allow the financial institution to fail, they are 
putting the financial system at risk. If they bail out the financial institution, they are 
sending the wrong signals to the market. There is a danger that large financial institutions 
will be less vigilant in controlling risks if they know that they are “too big to fail” and 
the government will always bail them out. 

During the market turmoil of 2008, the decision was taken to bail out some financial 
institutions in the United States and Europe. However, Lehman Brothers was allowed to 
fail in September 2008. Possibly, the United States government wanted to make it clear 
to the market that bailouts for large financial institutions were not automatic. However, 
the decision to let Lehman Brothers fail has been criticized because arguably it made the 
credit crisis worse. 


15.2 Bank Regulation Pre-1988 


Prior to 1988, bank regulators within a country tended to regulate bank capital by setting 
minimum levels for the ratio of capital to total assets. However, definitions of capital 
and the ratios considered acceptable varied from country to country. Some countries 
enforced their regulations more diligently than other countries. Increasingly, banks were 


' As mentioned in Chapter 3, this is an example of what insurance companies term moral hazard. The 
existence of an insurance contract changes the behavior of the insured party. 
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BUSINESS SNAPSHOT 15.1 
Systemic Risk 


Systemic risk is the risk that a default by one financial institution will create a 
“ripple effect” that leads to defaults by other financial institutions and threatens 
the stability of the financial system. There are huge numbers of over-the-counter 
transactions between banks. If Bank A fails, Bank B may take a huge loss on the 
transactions it has with Bank A. This in turn could lead to Bank B failing. Bank 
C that has many outstanding transactions with both Bank A and Bank B might 
then take a large loss and experience severe financial difficulties; and so on. 

The financial system has survived defaults such as Drexel in 1990, Barings 
in 1995, and Lehman Brothers in 2008 very well, but regulators continue to be 
concerned. During the market turmoil of 2007 and 2008, many large financial 
institutions were bailed out, rather than being allowed to fail, because govern- 
ments were concerned about systemic risk. 


competing globally and a bank operating in a country where capital regulations were 
slack was considered to have a competitive edge over one operating in a country with 
tighter, more strictly enforced capital regulations. In addition, the huge exposures created 
by loans from the major international banks to less developed countries such as Mexico, 
Brazil, and Argentina, as well as the accounting games sometimes used for those exposures 
(see Business Snapshot 2.3) were starting to raise questions about the adequacy of capital 
levels. 

Another problem was that the types of transactions entered into by banks were 
becoming more complicated. The over-the-counter derivatives market for products such 
as interest rate swaps, currency swaps, and foreign exchange options was growing fast. 
These contracts increase the credit risks being taken by a bank. Consider, for example, 
an interest rate swap. If the counterparty in the interest rate swap transaction defaults 
when the swap has a positive value to the bank and a negative value to the counter- 
party, the bank is liable to lose money. The potential future exposure from derivatives 
was not reflected in the bank’s reported assets. As a result, it had no effect on the level 
of assets reported by a bank and therefore no effect on the amount of capital the bank 
was required to keep. It became apparent to regulators that the value of total assets was 
no longer a good indicator of the total risks being taken. A more sophisticated approach 
than that of setting minimum levels for the ratio of capital to total balance-sheet assets 
was needed. 

The Basel Committee was formed in 1974. The committee consisted of representa- 
tives from Belgium, Canada, France, Germany, Italy, Japan, Luxembourg, the Netherlands, 
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Sweden, Switzerland, the United Kingdom, and the United States. It met regularly in 
Basel, Switzerland, under the patronage of the Bank for International Settlements. The 
first major result of these meetings was a document titled “International Convergence 
of Capital Measurement and Capital Standards.” This was referred to as “The 1988 BIS 
Accord” or just “The Accord.” Later it became known as Basel I. 


15.3 The 1988 BIS Accord 


The 1988 BIS Accord was the first attempt to set international risk-based standards for 
capital adequacy. It has been subject to much criticism as being too simple and somewhat 
arbitrary. In fact, the Accord was a huge achievement. It was signed by all 12 members of 
the Basel Committee and paved the way for significant increases in the resources banks 
devote to measuring, understanding, and managing risks. The key innovation in the 1988 
Accord was the Cooke ratio. 


15.3.1 The Cooke Ratio 


The Cooke ratio” considers credit risk exposures that are both on-balance-sheet and 
off-balance-sheet. It is based on what is known as the bank’s total risk-weighted assets (also 
sometimes referred to as the risk-weighted amount). This is a measure of the bank’s total 
credit exposure. 

Credit risk exposures can be divided into three categories: 


1. Those arising from on-balance sheet assets (excluding derivatives) 
2. Those arising for off-balance sheet items (excluding derivatives) 
3. Those arising from over-the-counter derivatives 


Consider the first category. Each on-balance-sheet asset is assigned a risk weight 
reflecting its credit risk. A sample of the risk weights specified in the Accord is shown 
in Table 15.1. Cash and securities issued by governments of OECD countries (members 
of the Organisation for Economic Co-operation and Development) are considered to 
have virtually zero risk and have a risk weight of zero. Loans to corporations have a risk 
weight of 100%. Loans to banks and government agencies in OECD countries have a 
risk weight of 20%. Uninsured residential mortgages have a risk weight of 50%. The total 
of the risk-weighted assets for N on-balance-sheet items equals 


where L; is the principal amount of the ith item and w is its risk weight. 


? The ratio is named after Peter Cooke, from the Bank of England. 
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Table 15.1 Risk Weights for On-Balance-Sheet Items 
Risk 
Weight (%) Asset Category 


0 Cash, gold bullion, claims on OECD governments such as Treasury bonds or 
insured residential mortgages 
20 Claims on OECD banks and OECD public-sector entities such as securities issued 
by U.S. government agencies or claims on municipalities 
50 Uninsured residential mortgage loans 
100 All other claims such as corporate bonds and less developed country debt, claims on 
non-OECD banks 


Example 15.1 


The assets of a bank consist of $100 million of corporate loans, $10 million of OECD 
government bonds, and $50 million of residential mortgages. The total of the risk- 
weighted assets is 


1.0 X 100 + 0.0 x 10 + 0.5 x 50 = 125 


or $125 million. 


Consider next the second category. This includes bankers’ acceptances, guarantees, 
and loan commitments. A credit equivalent amount is calculated by applying a conver- 
sion factor to the principal amount of the instrument. Instruments that from a credit 
perspective are considered to be similar to loans, such as bankers’ acceptances, have a 
conversion factor of 100%. Others, such as note issuance facilities (where a bank agrees 
that a company can issue short-term paper on pre-agreed terms in the future), have lower 
conversion factors. 

Finally, consider the third category. For an over-the-counter derivative such as an 
interest rate swap or a forward contract, the credit equivalent amount is calculated as 


max(V, 0) + aL (15.1) 


where V is the current value of the derivative to the bank, a is an add-on factor, and L 
is the principal amount. The first term in equation (15.1) is the current exposure. If 
the counterparty defaults today and V is positive, the contract is an asset to the bank 
and the bank is liable to lose V. If the counterparty defaults today and V is negative, 
the contract is an asset to the counterparty and there will be neither a gain nor a loss 
to the bank. The bank’s exposure is therefore max(V, 0). (More details on the way in 
which defaults are handled in the bilaterally cleared over-the-counter derivatives market 
are in Chapters 17 and 20.) The add-on amount, aL, is an allowance for the possibility 
of the exposure increasing in the future. Examples of the add-on factor, a, are shown in 
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Table 15.2 Add-On Factors as a Percent of Principal for Derivatives 


Remaining Interest Exchange Rate Precious Metals Other 
Maturity (yr) Rate and Gold Equity Except Gold Commodities 
<1 0.0 1.0 6.0 7.0 10.0 
1to5 0.5 5.0 8.0 7.0 12.0 
>5 1.5 7.5 10.0 8.0 15.0 


Table 15.2. Equation (15.1) is known as the current exposure method (CEM). The add-on 
factors were revised and extended in the years following 1988. 


Example 15.2 


A bank has entered into a $100 million interest rate swap with a remaining life of four 
years. The current value of the swap is $2.0 million. In this case, the add-on amount 
is 0.5% of the principal so that the credit equivalent amount is $2.0 million plus $0.5 
million or $2.5 million. 


The credit equivalent amount arising from either the second or third category of 
exposures is multiplied by the risk weight for the counterparty in order to calculate the 
risk-weighted assets. The risk weights are similar to those in Table 15.1 except that the 
risk weight for a corporation is 0.5 rather than 1.0. 


Example 15.3 


Consider again the bank in Example 15.2. If the interest rate swap is with a corporation, 
the risk-weighted assets are 2.5 X 0.5 or $1.25 million. If it is with an OECD bank, the 
risk-weighted assets are 2.5 X 0.2 or $0.5 million. 


Putting all this together, the total risk-weighted assets for a bank with N on-balance- 
sheet items and M off-balance-sheet items is 


2 wL, + $ w*C, (15.2) 


Here, L; is the principal of the ith on-balance-sheet asset and w, is the risk weight for the 
asset; C; is the credit equivalent amount for the jth derivative or other off-balance-sheet 
item and we is the risk weight of the counterparty for this jth item. 


15.3.2 Capital Requirement 


The Accord required banks to keep capital equal to at least 8% of the risk-weighted 
assets. The capital had two components: 
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1. Tier 1 Capital. This consists of items such as equity and noncumulative perpetual 
preferred stock? (Goodwill is subtracted from equity) 

2. Tier 2 Capital. This is sometimes referred to as Supplementary Capital. It includes 
instruments such as cumulative perpetual preferred stock, certain types of 99-year 
debenture issues, and subordinated debt (i.e., debt subordinated to depositors) with 
an original life of more than five years. 


Equity capital is the most important type of capital because it absorbs losses. If equity 
capital is greater than losses, a bank can continue as a going concern. If equity capital is 
less than losses, the bank is insolvent. In the latter case, Tier 2 capital becomes relevant. 
Because it is subordinate to depositors, it provides a cushion for depositors. If a bank is 
wound up after its Tier I capital has been used up, losses should be borne first by the 
Tier 2 capital and, only if that is insufficient, by depositors. (See Section 2.2.) 

The Accord required at least 50% of the required capital (that is, 4% of the risk- 
weighted assets) to be in Tier 1. Furthermore, the Accord required 2% of risk-weighted 
assets to be common equity. (Note that the Basel Committee has updated its definition 
of instruments that are eligible for Tier 1 capital and its definition of common equity in 
Basel III.) 

The bank supervisors in some countries require banks to hold more capital than the 
minimum specified by the Basel Committee and some banks themselves have a target 
for the capital they will hold that is higher than that specified by their bank supervisors. 


15.4 The G-30 Policy Recommendations 


In 1993, a working group consisting of end-users, dealers, academics, accountants, and 
lawyers involved in derivatives published a report that contained 20 risk management 
recommendations for dealers and end-users of derivatives and four recommendations 
for legislators, regulators, and supervisors. The report was based on a detailed survey of 
80 dealers and 72 end-users worldwide. The survey involved both questionnaires and 
in-depth interviews. The report is not a regulatory document, but it has been influential 
in the development of risk management practices. 
A brief summary of the important recommendations is as follows: 


1. A company’s policies on risk management should be clearly defined and approved 
by senior management, ideally at the board of directors level. Managers at all levels 
should enforce the policies. 


3 Noncumulative perpetual preferred stock is preferred stock lasting forever where there is a prede- 
termined dividend rate. Unpaid dividends do not cumulate (that is, the dividends for one year are not 
carried forward to the next year). 

* Goodwill is created when one company acquires another. It is the purchase price minus the book 
value of the assets. It represents the intangible assets acquired. 

>In cumulative preferred stock, unpaid dividends cumulate. Any backlog of dividends must be paid 
before dividends are paid on the common stock. 
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2. Derivatives positions should be marked to market (i.e., revalued using a model that 
is consistent with market prices) at least once a day. 

3. Derivatives dealers should measure market risk using a consistent measure such as 
value at risk. Limits to the market risks that are taken should be set. 

4. Derivatives dealers should carry out stress tests to determine potential losses under 
extreme market conditions. 

5. The risk management function should be set up so that it is independent of the 
trading operation. 

6. Credit exposures arising from derivatives trading should be assessed based on the cur- 
rent replacement value of existing positions and potential future replacement costs. 

7. Credit exposures to a counterparty should be aggregated in a way that reflects 
enforceable netting agreements. (We talk about netting in the next section.) 

8. The individuals responsible for setting credit limits should be independent of those 
involved in trading. 

9. Dealers and end-users should assess carefully both the costs and benefits of credit risk 
mitigation techniques such as collateralization and downgrade triggers. In particular, 
they should assess their own capacity and that of their counterparties to meet the 
cash flow requirement of downgrade triggers. (Downgrade triggers are discussed in 
Chapter 20.) 

10. Only individuals with the appropriate skills and experience should be allowed to 
have responsibility for trading derivatives, supervising the trading, carrying out back 
office functions in relation to the trading, and so on. 

11. There should be adequate systems in place for data capture, processing, settlement, 
and management reporting. 

12. Dealers and end-users should account for the derivatives transactions used to manage 
risks so as to achieve a consistency of income recognition treatment between those 
instruments and the risks being managed. 


15.5 Netting 


Participants in the over-the-counter derivatives market have traditionally signed an Inter- 
national Swaps and Derivatives Association (ISDA) master agreement covering their 
derivatives trades. The word netting refers to a clause in the master agreement, which 
states that in the event of a default all transactions are considered as a single transaction. 
Effectively, this means that, if a company defaults on one transaction that is covered by the 
master agreement, it must default on all transactions covered by the master agreement. 
Netting and ISDA master agreements will be discussed in Chapters 17 and 20. At 
this stage, we note that netting can have the effect of substantially reducing credit risk. 
Consider a bank that has three swap transactions outstanding with a particular coun- 
terparty. The transactions are worth +$24 million, -$17 million, and +$8 million to 
the bank. Suppose that the counterparty experiences financial difficulties and defaults 
on its outstanding obligations. To the counterparty the three transactions have values of 
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—$24 million, +$17 million, and —$8 million, respectively. Without netting, the counter- 
party would default on the first transaction, keep the second transaction, and default on 
the third transaction. Assuming no recovery, the loss to the bank would be $32 (= 24 + 8) 
million. With netting, the counterparty is required to default on the second transaction 
as well. The loss to the bank is then $15 (= 24 — 17 + 8) million. 

More generally, suppose that a financial institution has a portfolio of N derivatives 
outstanding with a particular counterparty and that the current value of the ith derivative 
is V;. Without netting, the financial institution’s exposure in the event of a default today is 


» max(V,, 0 
i=1 


N 
max 2 V; o) 
i=1 


Without netting, the exposure is the payoff from a portfolio of options. With netting, 


With netting, it is 


the exposure is the payoff from an option on a portfolio. 

The 1988 Basel Accord did not take netting into account in setting capital require- 
ments. From equation (15.1) the credit equivalent amount for a portfolio of derivatives 
with a counterparty under the Accord was 


N 
>! [max( (V; 0) + a;L;] 
i=1 


where a; is the add-on factor for the ith transaction and L, is the principal for the ith 
transaction. 

By 1995, netting had been successfully tested in the courts in many jurisdictions. As 
a result, the 1988 Accord was modified to allow banks to reduce their credit equivalent 
totals when enforceable bilateral netting agreements were in place. The first step was to 
calculate the net replacement ratio, NRR. This is the ratio of the current exposure with 
netting to the current exposure without netting: 


max(X z Vj, 0) 


NRR = = 
>, max(V,, 0) 


The credit equivalent amount was modified to 


N 
nn (Y v.o) + + (0.4 + 0.6 X NRR) Lal 
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Table 15.3 Portfolio of Derivatives with a Particular Counterparty 


Current 
Transaction Principal, L; Value, V; Add-On Amount, a;L; 
3-year interest rate swap 1,000 —60 5 
6-year foreign exchange forward 1,000 70 75 
9-month option on a stock 500 55 30 


Example 15.4 


Consider the example in Table 15.3, which shows a portfolio of three derivatives that 
a bank has with a particular counterparty. The third column shows the current mark- 
to-market values of the transactions and the fourth column shows the add-on amount 
calculated from Table 15.2. The current exposure with netting is —60 + 70 + 55 = 65. 
The current exposure without netting is 0 + 70 + 55 = 125. The net replacement ratio 
is given by 


NRR = LERS 0.52 
125 


The total of the add-on amounts, )} a;L;, is 5 +75 + 30 = 110. The credit equivalent 
amount when netting agreements are in place is 65 + (0.4 + 0.6 X 0.52) X 110 = 143.32. 
Without netting, the credit equivalent amount is 125 + 110 = 235. Suppose that the 
counterparty is an OECD bank so that the risk weight is 0.2. This means that the 
risk-weighted assets with netting is 0.2 x 143.32 = 28.66. Without netting, it is 


0.2 X 235 = 47. 


15.6 The 1996 Amendment 


In 1995, the Basel Committee issued a consultative proposal to amend the 1988 Accord. 
This became known as the “1996 Amendment.” It was implemented in 1998 and was 
then sometimes referred to as “BIS 98”? The amendment involves keeping capital for the 
market risks associated with trading activities. 

Marking to market is the practice of revaluing assets and liabilities daily using a model 
that is calibrated to current market prices. It is also known as fair value accounting. Banks are 
required to use fair value accounting for all assets and liabilities that are held for trading 
purposes. This includes most derivatives, marketable equity securities, foreign currencies, 
and commodities. These items constitute what is referred to as the bank’s trading book. 

Under the 1996 Amendment, the credit risk capital charge in the 1988 Accord con- 
tinued to apply to all on-balance-sheet and off-balance-sheet items in the trading and 
banking book, except positions in the trading book that consisted of (a) debt and equity 
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traded securities and (b) positions in commodities and foreign exchange. The Amend- 
ment introduced a capital charge for the market risk associated with all items in the 
trading book 

The 1996 Amendment outlined a standardized approach for measuring the capital 
charge for market risk. The standardized approach assigned capital separately to each of 
debt securities, equity securities, foreign exchange risk, commodities risk, and options. 
No account was taken of correlations between different types of instruments. The more 
sophisticated banks with well-established risk management functions were allowed to 
use an “internal model-based approach” for setting market risk capital. This involved 
calculating a value-at-risk measure and converting it into a capital requirement using a 
formula specified in the 1996 Amendment. Most large banks preferred to use the internal 
model-based approach because it better reflected the benefits of diversification and led 
to lower capital requirements. 

The value-at-risk measure used in the internal model-based approach was calculated 
with a 10-day time horizon and a 99% confidence level. It is the loss that has a 1% chance 
of being exceeded over a 10-day period. The capital requirement is 


max(VaR,_,,m, X VaR ivo) + SRC (15.3) 


ie) 
where m, is a multiplicative factor, and SRC is a specific risk charge. The variable VaR,_, 


is the previous day’s value at risk and VaR „e is the average value at risk over the past 


av! 
60 days. The minimum value for m, is 3. Figher values may be chosen by regulators for 
a particular bank if tests reveal inadequacies in the bank’s value-at-risk model, as will be 
explained shortly. 

The first term in equation (15.3) covers risks relating to movements in broad market 
variables such as interest rates, exchange rates, stock indices, and commodity prices. The 
second term, SRC, covers risks related to specific companies such as those concerned 
with movements in a company’s stock price or changes in a company’s credit spread. 

Consider the first term and assume m, = 3. In most circumstances, the most recently 
calculated VaR, VaR,_), is less than three times the average VaR over the past 60 days. 
This gives the capital requirement for movements in broad market variables as simply 


3 x VaR ve 


The most popular method for calculating VaR is historical simulation, described in Chap- 
ter 13. As explained in Chapter 12, banks almost invariably calculate a one-day 99% VaR 
in the first instance. When formulating the 1996 amendment, regulators explicitly stated 
that the 10-day 99% VaR can be calculated as 1/10 times the one-day 99% VaR. This 
means that, when the capital requirement for a bank is calculated as m, times the average 


é Certain nontrading book positions that are used to hedge positions in the trading book can be 
included in the calculation of the market risk capital charge. 
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10-day 99% VaR, it is to all intents and purposes m, X v10 = 3.16m, times the average 
one-day 99% VaR. If m, = 3, it is 9.48 times the average one-day VaR. 

Consider next SRC. One security that gives rise to an SRC is a corporate bond. 
There are two components to the risk of this security: interest rate risk and credit risk of 
the corporation issuing the bond. The interest rate risk is captured by the first term in 
equation (15.3); the credit risk is captured by the SRC.’ The 1996 Amendment proposed 
standardized methods for assessing the SRC, but allowed banks to use internal models 
once regulatory approval for the models had been obtained. 

The internal model for SRC must involve calculating a 10-day 99% value at risk 
for specific risks. Regulators calculate capital by applying a multiplicative factor (similar 
to m,) to the value at risk. This multiplicative factor must be at least 4 and the resultant 
capital must be at least 50% of the capital given by the standardized approach. A method 
for calculating SRC from credit spreads will be given in Section 21.5. 

The total capital a bank was required to keep after the implementation of the 1996 
Amendment was the sum of (a) credit risk capital equal to 8% of the risk-weighted 
assets (RWA) and (b) market risk capital as explained in this section. For convenience, 
an RWA for market risk capital was defined as 12.5 multiplied by the amount given in 
equation (15.3). This means that the total capital required for credit and market risk is 
given by 


Total Capital = 0.08 X (credit risk RWA + market risk RWA) (15.4) 


A bank had more flexibility in the type of capital it used for market risk. It could use 
Tier 1 or Tier 2 capital. It could also use what is termed Tier 3 capital. This consists 
of short-term subordinated debt with an original maturity of at least two years that is 
unsecured and fully paid up. (Tier 3 capital was eliminated under Basel III.) 


15.6.1 Back-Testing 


The BIS Amendment requires the one-day 99% VaR that a bank calculates to be back- 
tested over the previous 250 days. As described in Section 12.10, this involves using the 
bank’s current procedure for estimating VaR for each of the most recent 250 days. If the 
actual loss that occurred on a day is greater than the VaR level calculated for the day, 
an “exception” is recorded. Calculations are carried out (a) including changes that were 
made to the portfolio on the day being considered and (b) assuming that no changes 
were made to the portfolio on the day being considered. (Regulators like to see both 
calculations.) 

If the number of exceptions during the previous 250 days is less than 5, m, is normally 
set equal to 3. Ifthe number of exceptions is 5,6, 7,8, and 9, the value of the m, is set equal 


7 As mentioned earlier, the 1988 credit risk capital charge did not apply to debt securities in the trading 
book under the 1996 Amendment. 
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to 3.4,3.5,3.65, 3.75, and 3.85, respectively. The bank supervisor has some discretion as to 
whether the higher multipliers are used. They will normally be applied when the reason 
for the exceptions is identified as a deficiency in the VaR model being used. If changes 
in the bank’s positions during the day result in exceptions, the higher multiplier should 
be considered, but does not have to be used. When the only reason that is identified is 
bad luck, no guidance is provided for the supervisor. In circumstances where the number 
of exceptions is 10 or more, the Basel Amendment requires the multiplier to be set 
at 4. Problem 15.18 considers these guidelines in the context of the statistical tests we 
discussed in Section 12.10. 


15.7 Basel II 


The 1988 Basel Accord improved the way capital requirements were determined, but it 
does have significant weaknesses. Under the Accord, all loans by a bank to a corporation 
have a risk weight of 100% and require the same amount of capital. A loan to a corpo- 
ration with a AAA credit rating is treated in the same way as one to a corporation with 
a B credit ratings Also, in Basel I there was no model of default correlation. 

In June 1999, the Basel Committee proposed new rules that have become known as 
Basel II. These were revised in January 2001 and April 2003. A number of Quantitative 
Impact Studies (QISs) were carried out prior to the implementation of the new rules to 
test them by calculating the amount of capital that would be required if the rules had 
been in place? A final set of rules agreed to by all members of the Basel Committee was 
published in June 2004. This was updated in November 2005. Implementation of the 
rules began in 2007 after a further QIS. 

The Basel II capital requirements applied to “internationally active” banks. In the 
United States, there are many small regional banks and the U.S. regulatory authorities 
decided that Basel II would not apply to them. (These banks are regulated under what 
is termed Basel IA, which is similar to Basel I.) In Europe, all banks, large or small, were 
regulated under Basel II. Furthermore, the European Union required the Basel II rules 
to be applied to securities companies as well as banks. 

The Basel II is based on three “pillars”: 


1. Minimum Capital Requirements 
2. Supervisory Review 
3. Market Discipline 


In Pillar 1, the minimum capital requirement for credit risk in the banking book 
is calculated in a new way that reflects the credit risk of counterparties. The capital 


8 Credit ratings are discussed in Section 1.7. 

? One point to note about the QISs is that they do not take account of changes banks may choose 
to make to their portfolios to minimize their capital requirements once the new rules have been 
implemented. 


360 RISK MANAGEMENT AND FINANCIAL INSTITUTIONS 


requirement for market risk remains unchanged from the 1996 Amendment and there 
is a new capital charge for operational risk. The general requirement in Basel I that 
banks hold a total capital equal to 8% of risk-weighted assets (RWA) remains unchanged. 
When the capital requirement for a particular risk is calculated directly rather than in a 
way involving RWAs, it is multiplied by 12.5 to convert it into an RWA-equivalent. As 
a result it is always the case that 


Total Capital = 0.08 x (credit risk RWA + market risk RWA 
+ operational risk RWA) (15.5) 


Pillar 2 is concerned with the supervisory review process. It covers both quantitative and 
qualitative aspects of the ways risk is managed within a bank. Supervisors are required to 
ensure that a bank has a process in place for ensuring that capital levels are maintained. 
Banks are expected to keep more than the minimum regulatory capital to allow for 
fluctuations in capital requirements and possible difficulties in raising capital at short 
notice. Regulators in different countries are allowed some discretion in how rules are 
applied (so that they can take account of local conditions), but overall consistency in the 
application of the rules is required. Pillar 2 places more emphasis on early intervention 
when problems arise. Supervisors are required to do far more than just ensuring that the 
minimum capital required under Basel II is held. Part of their role is to encourage banks 
to develop and use better risk management techniques and to evaluate those techniques. 
They should evaluate risks that are not covered by Pillar 1 (e.g., concentration risks) and 
enter into an active dialogue with banks when deficiencies are identified. 

The third pillar, market discipline, requires banks to disclose more information about 
the way they allocate capital and the risks they take. The idea here is that banks will be 
subjected to added pressure to make sound risk management decisions if shareholders 
and potential shareholders have more information about those decisions. 


15.8 Credit Risk Capital Under Basel II 


For credit risk, Basel II specified three approaches: 


1. The Standardized Approach 
2. The Foundation Internal Ratings Based (IRB) Approach 
3. The Advanced IRB Approach 


However, the United States (which, as mentioned earlier, chose to apply Basel II only 
to large banks) decided that only the IRB approach can be used. 


15.8.1 The Standardized Approach 


The standardized approach is used by banks that are not sufficiently sophisticated (in the 
eyes of the regulators) to use the internal ratings approaches. The standardized approach 
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Table 15.4 Risk Weights as a Percent of Principal for Exposures to Countries, Banks, and 
Corporations Under Basel II’s Standardized Approach 


AAA A+ BBB+ BB+ B+ Below 
to AA- to A- to BBB- to BB- to B- B- Unrated 
Country“ 0 20 50 100 100 150 100 
Banks** 20 50 50 100 100 150 50 
Corporations 20 50 100 100 150 150 100 


*Includes exposures to the country’s central bank. 


** National supervisors have options as outlined in the text. 


is similar to Basel I except for the calculation of risk weights.!° Some of the new rules 
here are summarized in Table 15.4. Comparing Table 15.4 with Table 15.1, we see that 
the OECD status of a bank or a country is no longer considered important under Basel 
II. The risk weight for a country (sovereign) exposure ranges from 0% to 150% and 
the risk weight for an exposure to another bank or a corporation ranges from 20% to 
150%. In Table 15.1, OECD banks were implicitly assumed to be lesser credit risks 
than corporations. An OECD bank attracted a risk weight of 20% while a corporation 
attracted a risk weight of 100%. Table 15.4 treats banks and corporations much more 
equitably. An interesting observation from Table 15.4 for a country, corporation, or bank 
that wants to borrow money is that it may be better to have no credit rating at all than a 
very poor credit rating. (Usually a company gets a credit rating when it issues a publicly 
traded bond.) Supervisors are allowed to apply lower risk weights (20% rather than 50%, 
50% rather than 100%, and 100% rather than 150%) when exposures are to the country 
in which the bank is incorporated or to that country’s central bank. 

The risk weight for mortgages in the Basel II standardized approach was 35% and that 
for other retail loans was 75%. For claims on banks, the rules are somewhat complicated. 
Instead of using the risk weights in Table 15.4, national supervisors can choose to base 
capital requirements on the rating of the country in which the bank is incorporated. The 
risk weight assigned to the bank will be 20% if the country of incorporation has a rating 
between AAA and AA-, 50% if it is between A+ and A-, 100% if it is between BBB+ 
and B-, 150% if it is below B-, and 100% if it is unrated. Another complication is that, if 
national supervisors elect to use the rules in Table 15.4, they can choose to treat claims 
with a maturity less than three months more favorably so that the risk weights are 20% 
if the rating is between AAA+ and BBB-, 50% if it is between BB+ and B-, 150% if it 
is below B-, and 20% if it is unrated. 

In December 2017, the Basel Committee created a revised more granular standard- 
ized approach for determining risk weights for credit exposures. It also limited the use 
of the Advanced IRB approaches in some situations. As explained in Section 16.4, the 
revised standardized approach will provide a floor for the determination of total capital 
requirements. 


10 Ratios calculated using the new weights are sometimes referred to as McDonough ratios after 
William McDonough, the head of the Basel Committee. 
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Example 15.5 


Suppose that the assets of a bank consist of $100 million of loans to corporations rated A, 
$10 million of government bonds rated AAA, and $50 million of residential mortgages. 
Under the Basel II standardized approach, the total of the risk-weighted assets is 


0.5 x 100 + 0.0 x 10 + 0.35 x 50 = 67.5 


or $67.5 million. This compares with $125 million under Basel I. (See Example 15.1.) 


15.8.2 Adjustments for Collateral 


There are two ways banks can adjust risk weights for collateral. The first is termed the 
simple approach and is similar to an approach used in Basel I. The second is termed the 
comprehensive approach. Banks have a choice as to which approach is used in the banking 
book, but must use the comprehensive approach to calculate capital for counterparty 
credit risk in the trading book. 

Under the simple approach, the risk weight of the counterparty is replaced by the 
risk weight of the collateral for the part of the exposure covered by the collateral. (The 
exposure is calculated after netting.) For any exposure not covered by the collateral, the 
risk weight of the counterparty is used. The minimum level for the risk weight applied 
to the collateral is 20%.!! A requirement is that the collateral must be revalued at least 
every six months and must be pledged for at least the life of the exposure. 

Under the comprehensive approach, banks adjust the size of their exposure upward to 
allow for possible increases in the exposure and adjust the value of the collateral down- 
ward to allow for possible decreases in the value of the collateral.!? (The adjustments 
depend on the volatility of the exposure and the collateral.) A new exposure equal to 
the excess of the adjusted exposure over the adjusted value of the collateral is calculated 
and the counterparty’s risk weight is applied to this exposure. The adjustments applied to 
the exposure and the collateral can be calculated using rules specified in Basel II or, with 
regulatory approval, using a bank’s internal models. Where netting arrangements apply, 
exposures and collateral are separately netted and the adjustments made are weighted 
averages. 


Example 15.6 


Suppose that an $80 million exposure to a particular counterparty is secured by collateral 
worth $70 million. The collateral consists of bonds issued by an A-rated company. The 


11 An exception is when the collateral consists of cash or government securities with the currency of 
the collateral being the same as the currency of the exposure. 

12 An adjustment to the exposure is not likely to be necessary on a loan, but is likely to be necessary 
on an over-the-counter derivative. The adjustment is in addition to the add-on factor. 
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counterparty has a rating of B+. The risk weight for the counterparty is 150% and the 
risk weight for the collateral is 50%. The risk-weighted assets applicable to the exposure 
using the simple approach is 


0.5 X 70 + 1.50 x 10 = 50 


or $50 million. 

Consider next the comprehensive approach. Assume that the adjustment to exposure 
to allow for possible future increases in the exposure is +10% and the adjustment to the 
collateral to allow for possible future decreases in its value is —15%. The new exposure is 


1.1 x 80 — 0.85 x 70 = 28.5 


or $28.5 million, and a risk weight of 150% is applied to this exposure to give risk- 
adjusted assets equal to $42.75 million. 


15.8.3 The IRB Approach 


The model underlying the IRB approach is shown in Figure 15.1. Regulators base the 
capital requirement on the value at risk calculated using a one-year time horizon and a 
99.9% confidence level. They recognize that expected losses are usually covered by the 
way a financial institution prices its products. (For example, the interest charged by a bank 
on a loan is designed to recover expected loan losses.) The capital required is therefore 
the value at risk minus the expected loss. 

The value at risk is calculated using the one-factor Gaussian copula model of time 
to default that we discussed in Section 11.6. Assume that a bank has a very large number 


Expected one- One-year 
year loss 99% VaR 


Capital 


One-year loss 


Figure 15.1 The Loss Probability Density Function and the Capital Required by a Financial 
Institution 
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of obligors and the ith obligor has a one-year probability of default equal to PD;. The 
copula correlation between each pair of obligors is p.13 As in Section 11.6, we define 


N7'(PD,) + /pN' (0.999) 
WCDR, = N | ———————_ (15.6) 


VET 


where WCDR,, denotes the “worst-case default rate” defined so that the bank is 99.9% 
certain it will not be exceeded next year for the ith counterparty. Gordy’s (2003) research 
shows that for a large portfolio of instruments (loans, loan commitments, derivatives, 
and so on) that have the same p, in a one-factor model the one-year 99.9% VaR is 
approximately!* 


>! EAD; x LGD; x WCDR, 


where EAD, is the exposure at default of the ith counterparty and LGD, is the loss given 
default for the ith counterparty. 

The variable EAD, is the dollar amount that is expected to be owed by the ith coun- 
terparty at the time of default during the next year. The variable LGD; is the proportion 
of EAD, that is expected to be lost in the event of default. For example, if a bank expects 
to recover 30% of the amount owed in the event of default, LGD; = 0.7. 

The expected loss from defaults is 


Ý EAD; x LGD; x PD, 


The capital required in Figure 15.1 is the excess of the 99.9% worst-case loss over the 
expected loss. It is therefore 


Ý, EAD; x LGD, x (WCDR, — PD) (15.7) 


We now drop the subscripts and define for a counterparty: 


PD: The probability that the counterparty will default within one year (expressed as a 
decimal) 

EAD: The exposure at default (in dollars) 

LGD: The loss given default or the proportion of the exposure that is lost if there is a 
default (expressed as a decimal) 


Table 15.5 shows how WCDR depends on PD and p in the Gaussian copula model. 
When the correlation p is zero, WCDR = PD because in that case there is no default 


'3Note that the Basel Committee publications use R, not p, to denote the copula correlation. 
14See M. B. Gordy, “A Risk-Factor Model Foundation for Ratings-Based Bank Capital Ratios,” Journal 
of Financial Intermediation 12 (2003): 199-232. 
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Table 15.5 Dependence of One-Year 99.9% WCDR on PD and p 


PD = 0.1% PD = 0.5% PD = 1% PD = 1.5% PD = 2.0% 
p = 0.0 0.1% 0.5% 1.0% 1.5% 2.0% 
p = 0.2 2.8% 9.1% 14.6% 18.9% 22.6% 
p = 0.4 7.1% 21.1% 31.6% 39.0% 44.9% 
p = 0.6 13.5% 38.7% 54.2% 63.8% 70.5% 
p= 0.8 23.3% 66.3% 83.6% 90.8% 94.4% 


correlation and the percentage of loans defaulting can be expected to be the same in all 
years. As p increases, WCDR increases. 


15.8.4 Exposure at Default 


In the Foundation IRB approach, the calculation of EAD for derivatives is usually based 
on the CEM approach of Basel I.!° In the Advanced IRB approach, banks can use their 
internal models to calculate EAD. 

The first stage when internal models are used is to calculate expected exposure (EE) 
to each counterparty at a number of future times. (This generally involves Monte Carlo 
simulation and methods discussed in Chapter 20.) The Effective Expected Exposure 
(Effective EE) at a future time, t, is the maximum EE between time zero an time t. EAD 
is set equal to 1.4 times the Effective Expected Positive Exposure, which is the average 
of the Effective EEs over the first year.'° 


15.8.5 Corporate, Sovereign, and Bank Exposures 


In the case of corporate, sovereign, and bank exposures, Basel II assumes a relationship 
between the correlation parameter, p, and the probability of default, PD, based on empir- 
ical research." The formula is 

1 — exp(—50 x PD) 1 — exp(—50 x PD) 


p = 0.12——— + 0.24 ]1 - 
1 — exp(—50) 1 — exp(—50) 


Because exp(—50) is a very small number, this formula is to all intents and purposes 


p = 0.12 (1 + oP) (15.8) 


As PD increases, p decreases. The reason usually given for this inverse relationship is as 
follows. As a company becomes less creditworthy, its PD increases and its probability of 
default becomes more idiosyncratic and less affected by overall market conditions. 


15 As will be discussed in Section 16.4, this was revised under a regulatory change known as SA-CCR. 
'6See Basel Committee on Banking Supervision, “The Application of Basel II to Trading Activities 
and the Treatment of Double Default Effects,” July 2005. 

'7See J. Lopez, “The Empirical Relationship between Average Asset Correlation, Firm Probability of 
Default, and Asset Size,” Journal of Financial Intermediation 13, no. 2 (2004): 265-283. 
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Table 15.6 Relationship between One-Year 99.9% WCDR and PD for Corporate, Sovereign, 
and Bank Exposures 


PD 0.1% 0.5% 1% 1.5% 2.0% 
WCDR 3.4% 9.8% 14.0% 16.9% 19.0% 


Combining equation (15.8) with equation (15.6), we obtain the relationship between 
WCDR and PD in Table 15.6. WCDR is, as one would expect, an increasing function of 
PD. However, it does not increase as fast as it would if p were assumed to be independent 
of PD. 


The formula for the capital required for the counterparty is 
EAD x LGD x (WCDR — PD) x MA (15.9) 


The meaning of the first three terms in this expression should be clear from our earlier 
discussion leading to equation (15.7). The variable MA is the maturity adjustment and 
is defined as 


1+(M—2.5) xb 
1-1.5xb 


MA= (15.10) 


where 
b = [0.11852 — 0.05478 x In (PD)]? 


and M is the maturity of the exposure. The maturity adjustment is designed to allow 
for the fact that, if an instrument lasts longer than one year, there is a one-year credit 
exposure arising from a possible decline in the creditworthiness of the counterparty as 
well as from a possible default by the counterparty. (Note that, when M = 1, MA is 1.0 
and has no effect.) As mentioned in Section 15.7 (see Equation (15.5)), the risk-weighted 
assets (RWA) are calculated as 12.5 times the capital required 


RWA = 12.5 x EAD x LGD x (WCDR — PD) x MA 


so that the capital is 8% of RWA, 4% of which must be Tier 1. 

Under the Foundation IRB approach, banks supply PD while LGD, EAD, and M are 
supervisory values set by the Basel Committee. PD is subject to a floor of 0.03% for bank 
and corporate exposures. LGD is set at 45% for senior claims and 75% for subordinated 
claims. When there is eligible collateral, in order to correspond to the comprehensive 
approach that we described earlier, LGD is reduced by the ratio of the adjusted value of 
the collateral to the adjusted value of the exposure, both calculated using the comprehen- 
sive approach. For derivatives, the EAD is calculated using the CEM “current exposure 
plus add-on” approach of Basel I and includes the impact of netting. M is set at 2.5 in 
most circumstances. 
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Under the Advanced IRB approach, banks supply their own estimates of the PD, 
LGD, EAD, and M for corporate, sovereign, and bank exposures. The PD can be reduced 
by credit mitigants such as credit triggers. (As in the case of the Foundation IRB approach, 
it is subject to a floor of 0.03% for bank and corporate exposures.) The two main factors 
influencing the LGD are the seniority of the debt and the collateral. In calculating EAD, 
banks can with regulatory approval use their own models. As mentioned, in the case of 
derivatives, the model is likely to involve Monte Carlo simulation to determine how 
expected exposure (after netting and collateral) will vary over the next year. 

The capital given by equation (15.9) is intended to be sufficient to cover unexpected 
losses over a one-year period that we are 99.9% certain will not be exceeded. (As discussed 
earlier, the expected losses should be covered by a bank in the way it prices its products.) 
The WCDR is the default rate that (theoretically) happens once every thousand years. 
The Basel Committee reserved the right to apply a scaling factor (less than or greater 
than 1.0) to the result of the calculations in equation (15.9) if it finds that the aggregate 
capital requirements are too high or low. A typical scaling factor is 1.06. 


Example 15.7 


Suppose that the assets of a bank consist of $100 million of loans to A-rated corporations. 
The PD for the corporations is estimated as 0.1% and the LGD is 60%. The average 
maturity is 2.5 years for the corporate loans. This means that 


b = [0.11852 — 0.05478 x In(0.001)]? = 0.247 


so that 


1 


MA = —————— = 
1—1.5 x 0.247 


1.59 


From Table 15.6, the WCDR is 3.4%. Under the Basel II IRB approach, the risk- 
weighted assets for the corporate loans are 


12.5 x 100 x 0.6 X (0.034 — 0.001) x 1.59 = 39.3 


or $39.3 million. This compares with $100 million under Basel I and $50 million under 
the standardized approach of Basel II. (See Examples 15.1 and 15.5 where a $100 million 
corporate loan is part of the portfolio.) 


15.8.6 Retail Exposures 


The model underlying the calculation of capital for retail exposures is similar to that 
underlying the calculation of corporate, sovereign, and banking exposures. However, the 
Foundation IRB and Advanced IRB approaches are merged and all banks using the 
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Table 15.7 Relationship between One-Year 99.9% WCDR and PD for Retail Exposures 


PD 0.1% 0.5% 1.0% 1.5% 2.0% 
WCDR 2.1% 6.3% 9.1% 11.0% 12.3% 


IRB approach provide their own estimates of PD, EAD, and LGD. There is no maturity 
adjustment, MA. The capital requirement is therefore 


EAD x LGD x (WCDR — PD) 
and the risk-weighted assets are 
12.5 X EAD x LGD x (WCDR — PD) 


WCDR is calculated as in equation (15.6). For residential mortgages, p is set equal to 
0.15 in this equation.!® For qualifying revolving exposures, p is set equal to 0.04. For all 
other retail exposures, a relationship between p and PD is specified for the calculation of 
WCDR. This is 


1 — exp(—35 X PD) 1 — exp(—35 X PD) 
p = 0.03 —__—____—. + 0.16 |1 — —— 
1 — exp(—35) 1 — exp(—35) 


Because exp(—35) is a very small number, this formula is to all intents and purposes 
p = 0.03 + 0.1347 *PP (15.11) 


Comparing equation (15.11) with equation (15.8), we see that correlations are assumed 
to be much lower for retail exposures than for corporate exposures. Table 15.7 is the 
table corresponding to Table 15.6 for retail exposures. 


Example 15.8 


Suppose that the assets of a bank consist of $50 million of residential mortgages where 
the PD is 0.005 and the LGD is 20%. In this case, p = 0.15 and 


N-!(0.005) + /0.15N7!(0.999) 
WCDR = N | ———————— "| = 0.067 


y1-0.15 


The risk-weighted assets are 


12.5 X 50 x 0.2 X (0.067 — 0.005) = 7.8 


'8Tn the light of experience during the crisis, this is probably too low. 


Basel I, Basel I, and Solvency II 369 


or $7.8 million. This compares with $25 million under Basel I and $17.5 million under 
the Standardized Approach of Basel II. (See Examples 15.1 and 15.5 where $50 million 
of residential mortgages is part of the portfolio.) 


15.8.7 Guarantees and Credit Derivatives 


The approach traditionally taken by the Basel Committee for handling guarantees and 
credit derivatives such as credit default swaps is the credit substitution approach. Suppose 
that a AA-rated company guarantees a loan to a BBB-rated company. For the purposes 
of calculating capital, the credit rating of the guarantor is substituted for the credit rating 
of the borrower so that capital is calculated as though the loan had been made to the 
AA-rated company. This overstates the credit risk because, for the lender to lose money, 
both the guarantor and the borrower must default (with the guarantor defaulting before 
the borrower).'? The Basel Committee has addressed this issue. In July 2005, it pub- 
lished a document concerned with the treatment of double defaults under Basel II?° 
As an alternative to using the credit substitution approach, the capital requirement can 
be calculated as the capital that would be required without the guarantee multiplied by 
0.15 + 160 x PD, where PD, is the one-year probability of default of the guarantor. 


15.9 Operational Risk Capital Under Basel II 


In addition to changing the way banks calculate credit risk capital, Basel I] requires banks 
to keep capital for operational risk. This is the risk of losses from situations where the 
bank’s procedures fail to work as they are supposed to or where there is an adverse 
external event such as a fire in a key facility. 

There are three approaches to calculating capital for operational risk in Basel II: 


1. The Basic Indicator Approach 
2. The Standardized Approach 
3. The Advanced Measurement Approach 


Which of these is used depends on the sophistication of the bank. The simplest 
approach is the Basic Indicator Approach. This sets the operational risk capital equal to 
the bank’s average annual gross income over the last three years multiplied by 0.152! The 
Standardized Approach is similar to the basic indicator approach except that a different 


Credit default swaps, which we discuss in Chapter 19, provide a type of insurance against default 
and are handled similarly to guarantees for regulatory purposes. 

0See Bank for International Settlements, “The Application of Basel II to Trading Activities and the 
Treatment of Double Defaults,’ July 2005, available on www.bis.org. 

21 Gross income defined as net interest income plus non-interest income. Net interest income is the 
excess of income earned on loans over interest paid on deposits and other instruments that are used 
to fund the loans. Years where gross income is negative are not included in the calculations. 
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factor is applied to the gross income from different business lines. In the Advanced Mea- 
surement Approach, the bank uses its own internal models to calculate the operational 
risk loss that it is 99.9% certain will not be exceeded in one year. Similarly to the way 
credit risk capital is calculated in the IRB approach, operational risk capital is set equal to 
this loss minus the expected loss. One advantage of the advanced measurement approach 
is that it allows banks to recognize the risk-mitigating impact of insurance contracts 
subject to certain conditions. The calculation of operational risk is discussed further in 
Chapter 23. 

As will be explained in Chapter 23, regulators do not feel the Advanced Measurement 
Approach has been successful and in 2016 proposed replacing it with a new, simpler 
approach. 


15.10 Pillar 2: Supervisory Review 


Pillar 2 of Basel II is concerned with the supervisory review process. Four key principles 
of supervisory review are specified: 


1. Banks should have a process for assessing their overall capital adequacy in relation to 
their risk profile and a strategy for maintaining their capital levels. 

2. Supervisors should review and evaluate banks’ internal capital adequacy assessments 
and strategies, as well as their ability to monitor and ensure compliance with regula- 
tory capital ratios. Supervisors should take appropriate supervisory action if they are 
not satisfied with the result of this process. 

3. Supervisors should expect banks to operate above the minimum regulatory capital 
and should have the ability to require banks to hold capital in excess of this minimum. 

4. Supervisors should seek to intervene at an early stage to prevent capital from falling 
below the minimum levels required to support the risk characteristics of a particular 
bank and should require rapid remedial action if capital is not maintained or restored. 


The Basel Committee suggests that regulators pay particular attention to interest rate 
risk in the banking book, credit risk, and operational risk. Key issues in credit risk are 
stress tests used, default definitions used, credit risk concentration, and the risks associated 
with the use of collateral, guarantees, and credit derivatives. 

The Basel Committee also stresses that there should be transparency and account- 
ability in the procedures used by bank supervisors. This is particularly important when a 
supervisor exercises discretion in the procedures used or sets capital requirements above 
the minimum specified in Basel II. 


15.11 Pillar 3: Market Discipline 


Pillar 3 of Basel II is concerned with increasing the disclosure by a bank of its risk 
assessment procedures and capital adequacy. The extent to which regulators can force 
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banks to increase their disclosure varies from jurisdiction to jurisdiction. However, banks 
are unlikely to ignore directives on this from their supervisors, given the potential of 
supervisors to make their life difficult. Also, in some instances, banks have to increase 
their disclosure in order to be allowed to use particular methodologies for calculating 
capital. 

Regulatory disclosures are likely to be different in form from accounting disclosures 
and need not be made in annual reports. It is largely left to the bank to choose disclosures 
that are material and relevant. Among the items that banks should disclose are: 


1. The entities in the banking group to which Basel II is applied and adjustments made 
for entities to which it is not applied 

2. The terms and conditions of the main features of all capital instruments 

3. A list of the instruments constituting Tier 1 capital and the amount of capital pro- 
vided by each item 

4. The total amount of Tier 2 capital 

5. Capital requirements for credit, market, and operational risk 

6. Other general information on the risks to which a bank is exposed and the assessment 
methods used by the bank for different categories of risk 

7. The structure of the risk management function and how it operates 


15.12 Solvency I 


As discussed in Section 3.11, there are no international standards for the regulation of 
insurance companies. In the United States, insurance companies are regulated at the state 
level with some input from the Federal Insurance Office and the National Association 
of Insurance Commissioners. In Europe, the regulation of insurance companies is han- 
dled by the European Union. The long-standing regulatory framework in the European 
Union, known as Solvency I, was replaced by Solvency II in 2016. Whereas Solvency I 
calculates capital only for underwriting risks, Solvency II considers investment risks and 
operational risks as well. 

Solvency II has many similarities to Basel II. There are three pillars. Pillar 1 is con- 
cerned with the calculation of capital requirements and the types of capital that are 
eligible. Pillar 2 is concerned with the supervisory review process. Pillar 3 is concerned 
with the disclosure of risk management information to the market. The three pillars are 
therefore analogous to the three pillars of Basel II. 

Pillar 1 of Solvency H specifies a minimum capital requirement (MCR) and a solvency 
capital requirement (SCR). If its capital falls below the SCR level, an insurance company 
should, at minimum, deliver to the supervisor a plan to restore capital to above the SCR 
level. The supervisor might require the insurance company to take particular measures 
to correct the situation. The MCR 1s regarded as an absolute minimum level of capital. 
If capital drops below the MCR level, supervisors may assume control of the insur- 
ance company and prevent it from taking new business. It might force the insurance 
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company into liquidation, transferring its policies to another company. The MCR is 
typically between 25% and 45% of the SCR. 

There are two ways to calculate the SCR: the standardized approach and the inter- 
nal models approach. The internal models approach involves a VaR calculation with a 
one-year time horizon and a 99.5% confidence limit. (The confidence level is therefore 
less than the 99.9% confidence level used in Pillar 1 of Basel H.) Longer time horizons 
with lower confidence levels are also allowed when the protection provided is consid- 
ered equivalent. The SCR involves a capital charge for investment risk, underwriting 
risk, and operational risk. Investment risk is subdivided into market risk and credit risk. 
Underwriting risk is subdivided into risk arising from life insurance, non-life insurance 
(i.e., property and casualty), and health insurance. 

Capital should be adequate to deal with large adverse events. Examples of the events 
considered in quantitative impact studies are: 


. A 32% decrease in global stock markets 

. A 20% decrease in real estate prices 

. A 20% change in foreign exchange rates 

. Specified catastrophic risk scenarios affecting property and casualty payouts 

. Health care costs increasing by a factor times the historical standard deviation of costs 
. A 10% increase in mortality rates 

. A 25% decrease in mortality rates 


CNA Uk WN 


. A 10% increase in expenses 


The internal models are required to satisfy three tests. The first is a statistical quality 
test. This is a test of the soundness of the data and methodology used in calculating VaR. 
The second is a calibration test. This is a test of whether risks have been assessed in 
accordance with a common SCR target criterion. The third is a use test. This is a test of 
whether the model is genuinely relevant to and used by risk managers. 

There are three types of capital in Solvency II. Tier 1 capital consists of equity capital, 
retained earnings, and other equivalent funding sources. Tier 2 capital consists of liabil- 
ities that are subordinated to policyholders and satisfy certain criteria concerning their 
availability in wind-down scenarios. Tier 3 capital consists of liabilities that are subordi- 
nated to policyholders and do not satisfy these criteria. Similarly to Basel II, the amount 
of capital that must be Tier 1, Tier 1 plus Tier 2, and Tier 1 plus Tier 2 plus Tier 3 is 
specified. 


Summary 


This chapter has provided an overview of capital requirements for banks throughout the 
world. The way in which regulators calculate the minimum capital a bank is required 
to hold has changed dramatically since the 1980s. Prior to 1988, regulators determined 
capital requirements by specifying minimum ratios for capital to assets or maximum ratios 
for assets to capital. In the late 1980s, both bank supervisors and the banks themselves 
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agreed that changes were necessary. Derivatives trading was increasing fast. Also, banks 
were competing globally and it was considered important to create a level playing field 
by making regulations uniform throughout the world. 

The 1988 Basel Accord assigned capital for credit risk for both on- and off-balance- 
sheet exposures. This involved calculating a risk-weighted asset for each item. The risk- 
weighted assets for an on-balance-sheet loan were calculated by multiplying the princi- 
pal by a risk weight for the counterparty. In the case of derivatives such as swaps, banks 
were first required to calculate credit equivalent amounts. The risk-weighted assets were 
obtained by multiplying the credit equivalent amount by a risk weight for the counter- 
party. Banks were required to keep capital equal to 8% of the total risk-weighted assets. 
In 1995, the capital requirements for credit risk were modified to incorporate netting. 
As a result of an amendment introduced in 1996, the Accord was changed to include 
a capital charge for market risk. Sophisticated banks could base the capital charge on a 
value-at-risk calculation. 

Basel II was proposed in 1999 and implemented by many banks in about 2007. It led 
to no immediate change to the capital requirement for market risk. Credit risk capital 
was calculated in a more sophisticated way than previously to reflect either (a) the credit 
ratings of obligors or (b) estimates made by the bank in conjunction with a default 
correlation parameter specified by regulators. In addition, there was a capital charge for 
operational risk. 

Solvency II is a regulatory framework for insurance companies that was implemented 
by the European Union in 2016. It prescribes minimum capital levels for investment risk, 
underwriting risk, and operational risk. The structure of Solvency II is broadly similar to 
Basel II. 
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Practice Questions and Problems (Answers at End of Book) 


15.1 “When a steel company goes bankrupt, other companies in the same industry 
benefit because they have one less competitor. But when a bank goes bankrupt 
other banks do not necessarily benefit.” Explain this statement. 

15.2 “The existence of deposit insurance makes it particularly important for there to 
be regulations on the amount of capital banks hold.’ Explain this statement. 
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15.3 


15.4 


15.5 


15.6 


15.7 


15.8 


15.9 


15.10 


15.11 


15.12 


15.13 


15.14 


15.15 


An interest rate swap involves the exchange of a fixed rate of interest for a floating 

rate of interest with both being applied to the same principal. The principals are 

not exchanged. What is the nature of the credit risk for a bank when it enters into 

a five-year interest rate swap with a notional principal of $100 million? Assume 

the swap is worth zero initially. 

In a currency swap, interest on a principal in one currency is exchanged for inter- 

est on a principal in another currency. The principals in the two currencies are 

exchanged at the end of the life of the swap. Why is the credit risk on a currency 

swap greater than that on an interest rate swap? 

A four-year interest rate swap currently has a negative value to a financial institu- 

tion. Is the financial institution exposed to credit risk on the transaction? Explain 

your answer. How would the capital requirement be calculated under Basel I? 

Estimate the capital required under Basel I for a bank that has the following trans- 

actions with a corporation. Assume no netting. 

(a) A nine-year interest rate swap with a notional principal of $250 million and a 
current market value of —$2 million. 

(b) A four-year interest rate swap with a notional principal of $100 million and a 
current value of $3.5 million. 

(c) A six-month derivative on a commodity with a principal of $50 million that 
is currently worth $1 million. 

What is the capital required in Problem 15.6 under Basel I assuming that the 1995 

netting amendment applies? 

All the derivatives transactions a bank has with a corporate client have a positive 

value to the bank. What is the value to the bank of netting provisions in its master 

agreement with the client? 

Explain why the final stage in the Basel II calculations for credit risk (IRB), market 

risk, and operational risk is to multiply by 12.5. 

What is the difference between the trading book and the banking book for a 

bank? A bank currently has a loan of $10 million to a corporate client. At the 

end of the life of the loan, the client would like to sell debt securities to the bank 

instead of borrowing. How does this potentially affect the nature of the bank’s 

regulatory capital calculations? 

Under Basel I, banks do not like lending to highly creditworthy companies and 

prefer to help them issue debt securities. Why is this? Do you think this changed 

as a result of Basel II? 

Banks sometimes use what is referred to as regulatory arbitrage to reduce their 

capital. What do you think this means? 

Equation (15.9) gives the formula for the capital required under Basel II. It involves 

four terms being multiplied together. Explain each of these terms. 

Explain the difference between the simple and the comprehensive approach for 

adjusting capital requirements for collateral. 

Explain the difference between the standardized approach, the IRB approach, and 

the Advanced IRB approach for calculating credit risk capital under Basel II. 
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15.16 


15.17 


15.18 


15.19 


15.20 


15.21 


15.22 


Explain the difference between the basic indicator approach, the standardized 
approach, and the advanced measurement approach for calculating operational 
risk capital under Basel II. 

Suppose that the assets of a bank consist of $200 million of retail loans (not mort- 
gages). The PD is 1% and the LGD is 70%. What are the risk-weighted assets under 
the Basel II IRB approach? What are the Tier 1 and Tier 2 capital requirements? 
Section 12.10 discusses how statistics can be used to accept or reject a VaR model. 
Section 15.6 discusses guidelines for bank supervisors in setting the VaR multiplier 
m,. It explains that, if the number of exceptions in 250 trials is five or more, then 
m, is increased. What is the chance of five or more exceptions if the VaR model 
is working well? 


Further Questions 


Why is there an add-on amount in Basel I for derivatives transactions? “Basel I 
could be improved if the add-on amount for a derivatives transaction depended 
on the value of the transaction.” How would you argue this viewpoint? 
Estimate the capital required under Basel I for a bank that has the following trans- 
actions with another bank. Assume no netting. 
(a) A two-year forward contract on a foreign currency, currently worth $2 million, 
to buy foreign currency worth $50 million 
(b) A long position in a six-month option on the S&P 500. The principal is $20 
million and the current value is $4 million. 
(c) A two-year swap involving oil. The principal is $30 million and the current 
value of the swap is -$5 million. 
What difference does it make if the netting amendment applies? 
A bank has the following transaction with a AA-rated corporation 
(a) A two-year interest rate swap with a principal of $100 million that is worth 
$3 million 
(b) A nine-month foreign exchange forward contract with a principal of $150 
million that is worth -$5 million 
(c) A long position in a six-month option on gold with a principal of $50 million 
that is worth $7 million 
What is the capital requirement under Basel I if there is no netting? What 
difference does it make if the netting amendment applies? What is the capital 
required under Basel II when the standardized approach is used? 
Suppose that the assets of a bank consist of $500 million of loans to BBB-rated 
corporations. The PD for the corporations is estimated as 0.3%. The average matu- 
rity is three years and the LGD is 60%. What is the total risk-weighted assets for 
credit risk under the Basel II IRB approach? How much Tier 1 and Tier 2 capital 
is required? How does this compare with the capital required under the Basel I 
standardized approach and under Basel I? 


Chapter 16 


Basel II.5, Basel III, and Other 
Post-Crisis Changes 


t was perhaps unfortunate for Basel II that its implementation date coincided, at least 
I approximately, with the start of the worst crisis that financial markets had experi- 

enced since the 1930s. Some commentators have blamed Basel II for the crisis. They 
point out that it was a move toward self-regulation where banks, when calculating regula- 
tory capital, had the freedom to use their own estimates of model inputs such as PD, LGD, 
and EAD. However, blaming Basel II may be unfair because, as explained in Chapter 6, 
the seeds of the crisis were sown well before Basel II was implemented! 

This chapter starts by discussing what has become known as Basel II.5. This is a 
collection of changes to the calculation of market risk capital that was put in place by 
the Basel Committee on Banking Supervision following the large losses experienced by 
banks during the crisis. The implementation date for Basel II.5 was December 31, 2011. 

The chapter then moves on to consider Basel HI, which was a major overhaul of bank 
regulations, published by the Basel Committee in December 2010. Basel HI includes a 
series of rules concerned with increasing the amount of equity capital that banks have 
to keep for the risks being taken and tightening the definition of capital. An important 
new feature of Basel III is the specification of liquidity requirements that must be met 
by banks. Basel III is being phased in over several years. Full implementation is expected 
to be complete by 2019. 


1 Also, the United States was behind other countries in implementing Basel II. If Basel II had been 
fully implemented by the start of the crisis, capital levels at U.S. banks would probably have been lower. 
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The chapter also discusses other regulations that have been introduced since the 2008 
crisis to complement the work of the Basel Committee. These include the Dodd—Frank 
Act, which was signed into law by President Barack Obama in the United States on July 
21, 2010, and rules introduced in the European Union and the United Kingdom. 


16.1 Basel II.5 


During the credit crisis, it was recognized that some changes were necessary to the calcu- 
lation of capital for market risk. These changes are referred to as Basel II.5 and, as already 
mentioned, the implementation date for them was December 31, 2011? There are three 
changes involving: 


1. The calculation of a stressed VaR; 
2. A new incremental risk charge; and 
3. A comprehensive risk measure for instruments dependent on credit correlation. 


The measures have the effect of greatly increasing the market risk capital that large banks 
are required to hold. 


16.1.1 Stressed VaR 


The 1996 Amendment to Basel I, where capital was first required for market risk, allowed 
banks to base capital on a 10-day 99% VaR measure. Most banks use historical simulation 
to calculate VaR. This is described in Chapter 13. The assumption underlying historical 
simulation was that the changes in market variables during the following day would be 
a random sample from their daily changes (percentage or actual) observed during the 
previous one to four years. The 2003-2006 period was one where the volatility of most 
market variables was low. As a result, the market risk VaRs calculated during this period 
for regulatory capital purposes were also low. Furthermore, the VaRs continued to be 
too low for a period of time after the onset of the crisis, because much of the data used 
to calculate them continued to come from a low-volatility period. 

This led the Basel Committee to introduce the stressed VaR measure. As explained in 
Section 13.1.3,stressed VaR is determined by basing calculations on how market variables 
moved during a 250-day (12-month) period of stressed market conditions, rather than on 
how they moved during the past one to four years. The historical simulation calculations 
to arrive at a stressed VaR measure assume that the changes in market variables during 
the next day are a random sample from their daily changes observed during the 250-day 
period of stressed market conditions. 

Basel II.5 requires banks to calculate two VaRs. One is the usual VaR (based on 
the previous one to four years of market movements). The other is the stressed VaR 


? See Basel Committee on Banking Supervision, “Revisions to the Basel II Market Risk Framework,” 
February 2011. 
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(calculated from a stressed period of 250 days). The two VaR measures are combined to 
calculate a total capital charge. The formula for the total capital charge is 


max(VaR,_,, m, X VaR.) + max(sVaR,_,, m, X sVaR 


xe) 


ag) 


where VaR,_,; and sVaR,_, are the VaR and stressed VaR (with a 10-day time horizon 
and a 99% confidence level) calculated on the previous day. The variables VaR wg and 
sVaR vg are the average of VaR and stressed VaR (again with a 10-day time horizon and 
a 99% confidence level) calculated over the previous 60 days. The parameters m, and m, 
are multiplicative factors that are determined by bank supervisors and are at minimum 
equal to three. As explained in Section 15.6, the capital requirement prior to Basel II.5 
was 
max(VaR,_;, m, X VaR wg) 

Because stressed VaR is always at least as great as VaR, the formula shows that (assuming 
m, =m) the impact of this new rule is to at least double the capital requirement. In 
practice, a trebling of the capital requirement as a result of the change was not unusual. 

Originally it was considered that 2008 would constitute a good one-year period for 
the calculation of stressed VaR. Later it was realized that the one-year period chosen 
should reflect a bank’s portfolio. A bank is now required to search for a one-year period 
that would be particularly stressful for its current portfolio. The stressed period used by 
one bank is therefore not necessarily the same as that used by another bank. 


16.1.2 Incremental Risk Charge 


In 2005, the Basel Committee became concerned that exposures in the trading book were 
attracting less capital than similar exposures in the banking book. Consider a bond. If held 
in the trading book, the capital would be calculated by applying a multiplier to the 10- 
day 99% VaR, as discussed in Section 15.6. If held in the banking book (and treated like a 
loan), capital for the bond would be calculated using VaR with a one-year time horizon 
and a 99.9% confidence level, as discussed in Section 15.8. The trading-book calculation 
usually gave rise to a much lower capital charge than the banking-book calculation. As 
a result, banks tended whenever possible to hold credit-dependent instruments in the 
trading book? 

Regulators proposed an incremental default risk charge (IDRC) in 2005 that would 
be calculated with a 99.9% confidence level and a one-year time horizon for instruments 
in the trading book that were sensitive to default risk. The effect of this would have been 


31f a bank created ABSs from loans in the banking book, as described in Chapter 6, and then bought 
all the resultant tranches for its trading book, regulatory capital requirements would be lowered even 
though the bank’s risks would be unchanged. This was one reason why banks wanted to securitize 
loans in the banking book. 
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that the capital requirement for these instruments equaled the maximum of the capital 
using trading book calculations and the capital using banking book calculations. In 2008, 
the Basel Committee recognized that most of the losses in the credit market turmoil of 
2007 and 2008 were from changes in credit ratings, widening of credit spreads, and loss 
of liquidity, rather than solely as a result of defaults. It therefore amended its previous 
proposals to reflect this and the IDRC became the incremental risk charge (IRC). 

The IRC requires banks to calculate a one-year 99.9% VaR for losses from credit- 
sensitive products in the trading book taking both credit rating changes and defaults 
into account. Like the IDRC, the aim was to set capital equal to the maximum of that 
obtained using trading book calculations and that obtained using banking book calcula- 
tions. Because the instruments subject to the IRC are in the trading book, it is assumed 
that a bank will have the opportunity to rebalance its portfolio during the course of the 
year so that default risk is mitigated. Banks are therefore required to estimate a liquidity 
horizon for each instrument subject to the IRC. The liquidity horizon represents the 
time required to sell the position or to hedge all material risks in a stressed market. 

Suppose that the liquidity horizon for a bond with a credit rating of A is three 
months. For the purposes of the calculation of VaR over a one-year time horizon, the 
bank assumes that at the end of three months, if the bond’s rating has changed or if it has 
defaulted, it is replaced by an A-rated bond similar to that held at the beginning of the 
period. The same thing happens at the end of six months and at the end of nine months. 
This is known as the constant level of risk assumption. 

The impact of the constant level of risk assumption is that it is less likely that there will 
be a default. Instead, small losses are realized from ratings downgrades when rebalancing 
takes place. The assumption typically has the effect of reducing the one-year 99.9% VaR 3 
The minimum liquidity horizon for IRC is specified by the Basel Committee as three 
months. 

The IRC therefore provides a measure of the default and credit migration risks of 
credit products over a one-year horizon at a 99.9% confidence level, taking into account 
the liquidity horizons of individual positions or sets of positions. 


16.1.3 The Comprehensive Risk Measure 


The comprehensive risk measure (CRM) is designed to take account of risks in what 
is known as the correlation book. This is the portfolio of instruments such as asset- 
backed securities (ABSs) and collateralized debt obligations (CDOs) that are sensitive 
to the correlation between the default risks of different assets. These instruments were 
discussed in Chapter 6. Suppose a bank has a AAA-rated tranche of an ABS. In the 
normal market environment, there is very little risk of losses from the tranche. However, 


* Basel Committee on Banking Supervision, “Guidelines for Computing Capital for Incremental Risk 
in the Trading Book,” July 2009. 
5See C. Finger, “CreditMetrics and Constant Level of Risk,’ MSCI, 2010, for a discussion of the 
constant level of risk assumption. 
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Table 16.1 Standardized Capital Charge for Correlation-Dependent Instruments 


External Credit AAA to BBB+ to BB+ to Below BB— or 
Assessment AA— A+ to A— BBB- BB- Unrated 
Securitizations 1.6% 4% 8% 28% Deduction 
Resecuritizations 3.2% 8% 18% 52% Deduction 


in stressed market environments when correlations increase, the tranche is vulnerable—as 
became apparent during the 2007—2009 crisis. 

The CRM is a single capital charge replacing the incremental risk charge and the 
specific risk charge for instruments dependent on credit correlation. The Basel II.5 stan- 
dardized approach for calculating the CRM is summarized in Table 16.1. Given the 
experience of the securitization market during the crisis (see Chapter 6), it is not sur- 
prising that capital charges are higher for resecuritizations (e.g., ABS CDOs) than for 
securitizations (e.g., ABSs). A deduction means than the principal amount is subtracted 
from capital, which is equivalent to a 100% capital charge. 

Basel II.5 allows banks, with supervisory approval, to use their internal models to 
calculate the CRM. The models developed by banks have to be quite sophisticated to be 
approved by bank supervisors. For example, they must capture the cumulative impact of 
multiple defaults, credit spread risk, the volatility of implied correlations, the relationship 
between credit spreads and implied correlations, recovery rate volatility, the risk of hedge 
slippage, and potential hedge rebalancing costs. A routine and rigorous program of stress 
testing is also required. The capital charge calculated from an internal model is subject 
to a floor calculated from the capital given by the standardized approach. 


16.2 Basel III 


Following the 2007—2009 credit crisis, the Basel Committee realized that a major over- 
haul of Basel II was necessary. Basel II.5 increased capital requirements for market risk. 
The Basel Committee wanted to increase equity capital requirements further. In addi- 
tion, it considered that the definition of equity capital needed to be tightened and that 
regulations were needed to address liquidity risk. 

Basel III proposals were first published in December 2009. Following comments 
from banks, a quantitative impact study, and a number of international summits, the final 
version of the regulations was published in December 2010.° There are six parts to the 
regulations: 


1. Capital Definition and Requirements 
2. Capital Conservation Bufter 


é See Basel Committee on Banking Supervision, “Basel III: A Global Regulatory Framework for More 
Resilient Banks and Banking Systems,” June 2011, and Basel Committee on Banking Supervision, 
“Basel III: International Framework for Liquidity Risk Measurement Standards and Monitoring,” 
December 2010. 
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3. Countercyclical Buffer 

4. Leverage Ratio 

5. Liquidity Risk 

6. Counterparty Credit Risk 


The regulations are being implemented gradually between 2013 and 2019. 


16.2.1 Capital Definition and Requirements 
Under Basel HI, a bank’s total capital consists of: 


1. Tier 1 equity capital 
2. Additional Tier 1 capital 
3. Tier 2 capital 


There is no Tier 3 capital. 

Tier 1 equity capital (also referred to as common equity Tier 1, or CET1, capital) 
includes share capital and retained earnings but does not include goodwill or deferred tax 
assets. It must be adjusted downward to reflect defined benefit pension plan deficits but 
is not adjusted upward to reflect defined benefit plan surpluses. (See Section 3.12 for a 
discussion of defined benefit plans.) Changes in retained earnings arising from securitized 
transactions are not counted as part of capital for regulatory purposes. The same is true 
of changes in retained earnings arising from the bank’s own credit risk. (The latter is 
referred to as DVA and will be discussed in Chapter 20.) There are rules relating to the 
inclusion of minority interests and capital issued by consolidated subsidiaries. 

The additional Tier 1 (AT1) capital category consists of items, such as non- 
cumulative preferred stock, that were previously Tier 1 but are not common equity. 
Tier 2 capital includes debt that is subordinated to depositors with an original maturity 
of five years. 

Common equity is referred to by the Basel Committee as “going-concern capital.” 
When the bank is a going concern (1.e., has positive equity capital), common equity 
absorbs losses. Tier 2 capital is referred to as “gone-concern capital.’ When the bank 
is no longer a going concern (i.e., has negative capital), losses have to be absorbed by 
Tier 2 capital. Tier 2 capital ranks below depositors in a liquidation. While Tier 2 capital 
remains positive, depositors should in theory be repaid in full. 

The capital requirements are as follows: 


1. Tier 1 equity capital must be at least 4.5% of risk-weighted assets at all times. 

2. Total Tier 1 capital (Tier 1 equity capital plus additional Tier 1 capital) must be at 
6% of risk-weighted assets at all times. 

3. Total capital (total Tier 1 plus Tier 2) must be at least 8% of risk-weighted assets at 
all times. 


Basel I required Tier 1 equity capital to be at least 2% of risk-weighted assets and 
total Tier 1 capital to be at least 4% of risk-weighted assets. The Basel III rules are much 
more demanding because (a) these percentages have been increased and (b) the definition 
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of what qualifies as equity capital for regulatory purposes has been tightened. However, 
the Tier 1 plus Tier 2 requirement is the same as under Basel I and Basel II. 

The transitional arrangements were that Tier 1 equity capital and total Tier 1 capital 
must be 3.5% and 4.5%, respectively, by January 1, 2013. They must be 4% and 5.5%, 
respectively, by January 1, 2014. The new capital levels had to be in place by January 1, 
2015. The new rules for the definition of what constitutes capital were phased in over a 
longer period stretching until January 1, 2018. 

The Basel Committee also calls for more capital for “systemically important” banks, 
as will be discussed later in this chapter. 


16.2.2 Capital Conservation Buffer 


In addition to the capital requirements just mentioned, Basel III requires a capital con- 
servation buffer in normal times consisting of a further amount of CET1 capital equal 
to 2.5% of risk-weighted assets. This provision is designed to ensure that banks build up 
capital during normal times so that it can be run down when losses are incurred during 
periods of financial difficulties. (The argument in favor of this is that it is much easier 
for banks to raise capital during normal times than during periods of stressed market 
conditions.) In circumstances where the capital conservation buffer has been wholly or 
partially used up, banks are required to constrain their dividends until the capital has 
been replenished. The dividend rules are summarized in Table 16.2. For example, if Tier 
1 equity capital is 5.5% of risk-weighted assets, the minimum retained earnings is 80% 
so that the maximum dividends as a percent of retained earnings is 20%. The differ- 
ence between the 4.5% basic equity capital requirement and the 7.0% equity capital 
requirement that includes the capital conservation buffer is in some ways analogous to 
the difference between the MCR and SCR in Solvency II (see Section 15.12). 

The capital conservation buffer means that the Tier 1 equity capital that banks are 
required to keep in normal times (excluding other requirements that will be discussed 
later) is 7% of risk-weighted assets; total Tier 1 capital is required to be at least 8.5% 
of risk-weighted assets; Tier 1 plus Tier 2 capital is required to be at least 10.5% of 
risk-weighted assets. These numbers can decline to 4.5%, 6%, and 8% in stressed mar- 
ket conditions (because of losses), but banks are then under pressure to bring capital 
back up to the required levels. One of the consequences of the increased equity capital 


Table 16.2 Dividend Restrictions Arising from the 
Capital Conservation Buffer 


Minimum Percent of 
Tier 1 Equity Capital Ratio Earnings Retained 


4.000% to 5.125% 100% 
5.125% to 5.750% 80% 
5.750% to 6.375% 60% 
6.375% to 7.000% 40% 


>7% 0% 
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Table 16.3 Dividend Restrictions Arising from the Capital 
Conservation Buffer and 2.5% Countercyclical Buffer 


Minimum Percent of 


Tier 1 Equity Capital Ratio Earnings Retained 
4.50% to 5.75% 100% 
5.75% to 7.00% 80% 
7.00% to 8.25% 60% 
8.25% to 9.50% 40% 
>9.50% 0% 


requirement is that banks may find it difficult to achieve the returns on equity that they 
had during the 1990 to 2006 period. However, bank shareholders can console themselves 
that bank stock is less risky as a result of the extra capital. 

The capital conservation buffer requirement is being phased in between January 1, 
2016, and January 1, 2019. 


16.2.3 Countercyclical Buffer 


In addition to the capital conservation buffer, Basel III has specified a countercyclical 
buffer. This is similar to the capital conservation buffer, but the extent to which it is 
implemented in a particular country is left to the discretion of national authorities. The 
buffer is intended to provide protection for the cyclicality of bank earnings. The buffer 
can be set to between 0% and 2.5% of total risk-weighted assets and must be met with 
Tier 1 equity capital. 

For jurisdictions where the countercyclical buffer is non-zero, Table 16.2 is modified. 
For example, when the countercyclical buffer is set at its maximum level of 2.5%, it is 
replaced by Table 16.3. Like the capital conservation buffer, the countercyclical buffer 
requirements will be phased in between January 1, 2016, and January 1, 2019. 


16.2.4 Leverage Ratio 


In addition to the capital requirements based on risk-weighted assets, Basel III specifies 
a minimum leverage ratio of 3%.’ The leverage ratio is the ratio of a capital measure to 
an exposure measure. The capital measure is total Tier 1 capital. The exposure measure 
is the sum of (a) on-balance-sheet exposures, (b) derivatives exposures, (c) securities 
financing transaction exposures, and (d) offt-balance-sheet items. No risk-weighting 
adjustments are made. The on-balance-sheet exposures include all assets on the balance 
sheet. Derivatives exposures are calculated as “replacement cost plus add-on” similar 
to the way they are calculated under Basel I (see Section 15.3). Securities financing 
transaction exposures include transactions such as repurchase agreements and security 


7See Bank for International Settlements, “Basel III Leverage Ratio Framework and Disclosure 
Requirements,’ January 2014. 
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lending/borrowing when the transaction does not lead to balance sheet assets. Off- 
balance-sheet items include loan commitments, loan substitutes, acceptances, and letters 
of credit. The leverage ratio was introduced in January 2018 after a transition period. 

Regulators in some countries have indicated that they regard Basel’s leverage ratio of 
3% as too low. In April 2014, regulators in the United States proposed rules requiring a 
5% leverage ratio for eight large bank holding companies (which, as we will explain, are 
known as G-SIBs), and 6% for the FDIC-insured subsidiaries of these bank holding com- 
panies. In October 2014, the Bank of England’s Financial Policy Committee increased 
the leverage ratio for UK banks to 4.05%, a level that could be increased to 4.95% in 
boom times to rein in excessive lending. In China the leverage ratio has been set at 4%. 

Why did the Basel Committee introduce the leverage ratio? The reason is that reg- 
ulators thought that banks had too much discretion in the way risk-weighted assets were 
calculated. They have far less discretion in the way “total exposure” is calculated. It should 
be emphasized that this does not mean that regulators are discarding capital requirements 
based on risk-weighted assets. They require banks to satisfy both (a) the ratios of capital 
to risk-weighted assets mentioned earlier in this chapter and (b) the ratio of capital to 
non-risk-weighted exposure leverage requirement explained here. 

One of the two capital ratios is likely to be the critical ratio for a bank (1.e., the 
ratio it is closest to not complying with). If the risk-weighted assets ratio is the critical 
one, then arguably the leverage ratio can be justified because it provides useful additional 
information to regulators. But if the leverage ratio is the critical one (and there is some 
evidence that it has been for some U.S. banks), a bank might be encouraged to hold risky 
assets because they have the same effect on the leverage ratio as safe assets but provide a 
higher expected return. This could be an unintended adverse consequence for regulators. 


16.2.5 Liquidity Risk 


Prior to the crisis, the focus of the Basel regulations had been on ensuring that banks had 
sufficient capital for the risks they were taking. It turned out that many of the problems 
encountered by financial institutions during the crisis were not as a result of a shortage 
of capital. They were instead a result of liquidity risks taken by the banks. 

Liquidity risks arise because there is a tendency for banks to finance long-term needs 
with short-term funding, such as commercial paper. Provided the bank is perceived by 
the market to be financially healthy, this is usually not a problem Suppose that a bank 
uses 90-day commercial paper to fund its activities. When one 90-day issue of commer- 
cial paper matures, the bank refinances with a new issue; when the new issue matures, 
it refinances with another issue; and so on. However, as soon as the bank experiences 
financial difficulties—or is thought to be experiencing financial difficulties—it is liable 


Sf the funds are being used to finance long-term fixed-rate loans and interest rates rise, net interest 
margins are squeezed. But this risk can be hedged with instruments such as interest rate swaps (see 
Example 5.1). 
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to become impossible for the bank to roll over its commercial paper. This type of prob- 
lem led to the demise of Northern Rock in the United Kingdom and Lehman Brothers 
in the United States. 

Basel III has introduced requirements involving two liquidity ratios that are designed 
to ensure that banks can survive liquidity pressures. The ratios are: 


1. Liquidity Coverage Ratio (LCR); and 
2. Net Stable Funding Ratio (NSFR) 


The LCR focuses on a bank’s ability to survive a 30-day period of liquidity disrup- 
tions. It is defined as: 


High-Quality Liquid Assets 
Net Cash Outflows in a 30-Day Period 


The 30-day period considered in the calculation of this ratio is one of acute stress involv- 
ing a downgrade of the bank’s debt by three notches (e.g., from AA-— to A—),a partial loss 
of deposits, a complete loss of wholesale funding, increased haircuts on secured funding 
(so that instruments posted as collateral are not valued as highly), and drawdowns on lines 
of credit. The Basel III regulations require the ratio to be greater than 100% so that the 
bank’s liquid assets are sufficient to survive these pressures. 

The NSFR focuses on liquidity management over a period of one year. It is defined 
as 


Amount of Stable Funding 
Required Amount of Stable Funding 


The numerator is calculated by multiplying each category of funding (capital, wholesale 
deposits, retail deposits, etc.) by an available stable funding (ASF) factor, reflecting their 
stability. As shown in Table 16.4, the ASF for wholesale deposits is less than that for retail 
deposits, which in turn is smaller than that for Tier 1 or Tier 2 capital. The denominator 


Table 16.4 ASF Factors for Net Stable Funding Ratio 
ASF Factor Category 


100% Tier 1 and Tier 2 capital 
Preferred stock and borrowing with a remaining maturity greater than one year 

90% “Stable” demand deposits and term deposits with a remaining maturity less 
than one year provided by retail or small business customers 

80% “Less Stable” demand deposits and term deposits with a remaining maturity 
less than one year provided by retail or small business customers 

50% Wholesale demand deposits and term deposits with a remaining maturity less 
than one year provided by non-financial corporates, sovereigns, central 
banks, multilateral development banks, and public-sector entities 

0% All other liability and equity categories 
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Table 16.5 RSF Factors for Net Stable Funding Ratio 
RSF Factor Category 


0% Cash 
Short-term instruments, securities, loans to financial entities if they have a 
residual maturity of less than one year 


5% Marketable securities with a residual maturity greater than one year if they are 
claims on sovereign governments or similar bodies with a 0% risk weight 
20% Corporate bonds with a rating of AA— or higher and a residual maturity 


greater than one year 
Claims on sovereign governments or similar bodies with a risk weight of 20% 


50% Gold, equity securities, bonds rated A+ to A— 
65% Residential mortgages 
85% Loans to retail and small business customers with a remaining maturity less 
than one year 
100% All other assets 


is calculated from the items requiring funding. Each category of these is multiplied by a 
required stable funding (RSF) factor to reflect the permanence of the funding required. 
Some of the applicable factors are indicated in Table 16.5. 

Basel HI requires the NSFR to be greater than 100% so that the calculated amount 
of stable funding is greater than the calculated required amount of stable funding. 


Example 16.1 
A bank has the following balance sheet: 


Cash 5 Retail Deposits (stable) 40 
Treasury Bonds (> 1 yr) 5 Wholesale Deposits 48 
Mortgages 20 Tier 2 Capital 4 
Small Business Loans 60 Tier 1 Capital 8 
Fixed Assets 10 

100 100 


The Amount of Stable Funding is 
40 x 0.9 + 48 x 0.5 + 4 x 1.0 + 8 x 1.0 = 72 
The Required Amount of Stable Funding is 
5X 0+5 x 0.05 + 20 x 0.65 + 60 x 0.85 + 10 x 1.0 = 74.25 


The NSFR is therefore 


aca 0.970 
74.25 


or 97.0%. The bank does not therefore satisfy the NSFR requirement. 
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The new rules are tough and have the potential to dramatically change bank balance 
sheets. It was estimated in September 2014 that the rules will require U.S. banks to hold 
an extra $100 billion of liquid assets. The LCR requirement was scheduled by the Basel 
Committee to be implemented in stages between January 1, 2015, and January 1, 2019 
(but regulators in the United States required full implementation by January 1, 2017). 
The NSFR requirement was scheduled to be implemented on January 1, 2018. 


16.2.6 Counterparty Credit Risk 


For each of its derivatives counterparties, a bank calculates a quantity known as the credit 
value adjustment (CVA). This is the expected loss because of the possibility of a default by 
the counterparty. The way in which the calculation of CVA is carried out is described in 
Chapter 20. Reported profit is reduced by the total of the CVAs for all counterparties. 

As we will see in Chapter 20, the CVA for a counterparty can change because either 
(a) the risk factors underlying the value of the derivatives entered into with the counter- 
party change or (b) the credit spreads applicable to the counterparty’s borrowing change. 
Basel II requires the CVA risk arising from changing credit spreads to be a component 
of market risk capital. As will be explained in Chapter 20, once CVA has been calcu- 
lated, it is a relatively simple matter to calculate the delta and gamma with respect to a 
parallel shift in the term structure of the counterparty’s credit spread. These can be used 
to calculate appropriate market risk measures. 

The Basel Committee proposed changes to the calculation of capital for CVA risk in 
2015.°? A major objective was to allow banks that hedged the impact of both credit spread 
changes and risk factor changes on CVA to take both into account when calculating 
regulatory capital. Without this change, hedges for risk factor changes would increase 
capital charges. 


16.2.7 G-SIBs, SIFIs, and D-SIBs 


Regulators are particularly concerned that large, systemically important financial insti- 
tutions keep sufficient capital to avoid a repeat of the government bailouts during the 
2007 to 2009 credit crisis. 

The term G-SIB stands for global systemically important bank, whereas the term 
SIFI (systemically important financial institution) is used to describe both banks and 
nonbanks that are considered to be systemically important. The popular view of SIFIs is 
that they are “too big to fail,’ and have been identified as the financial institutions that 
will have to be bailed out if they run into financial difficulties. 

The systemic importance of a bank or other financial institution depends on the 
effect that its failure could have on the global financial system. This in turn depends on the 


?See Basel Committee on Banking Supervision, “Review of Credit Valuation Adjustment Risk 
Framework,’ October 1, 2015. 
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nature of its activities and the contracts it has entered into with other financial institutions 
globally. The Basel Committee uses a scoring methodology to determine which banks 
are G-SIBs. Other approaches involving network theory have been attempted by some 
researchers. 

In 2013, the Basel Committee published the final version of rules that call for more 
Tier 1 equity capital for G-SIBs.'° G-SIBs are categorized according to whether the 
extra equity capital is 1%, 1.5%, 2%, 2.5%, or 3.5% of risk-weighted assets. The new rules 
took effect in January 2016. 

A list of G-SIBs is published by the Financial Stability Board annually. In November 
2017, the list included 30 banks. Of these, 17 were in the 1% category, 8 were in the 1.5% 
category, 4 were in the 2% category, and 1 (JPMorgan Chase) was in the 2.5% category. 
None were in the 3.5% category. G-SIBs are required to keep CET1 capital equal to 
a baseline 4.5% of risk-weighted assets plus a further 2.5% for the capital conservation 
buffer plus any extra amounts required by national supervisors. Suppose the total is X% 
of risk-weighted assets. From Section 16.2.1, all Tier 1 capital (CET1 plus AT1) must be 
at least (X + 1.5)% of risk-weighted assets, and Tier 1 plus Tier 2 must be at least (X + 
3.5)% of risk-weighted assets. 

In November 2014, the Financial Stability Board issued proposals concerning the 
total loss-absorbing capacity (TLAC) of G-SIBs. The proposals were developed in con- 
sultation with the Basel Committee on Banking Supervision and were in response to 
a call by G20 leaders at the 2013 St. Petersburg Summit. TLAC consists of instruments 
that can absorb losses and protect depositors in the event of a default. The instruments 
include equity, preferred shares, and subordinated debt. From January 1, 2019, the mini- 
mum TLAC for G-SIBs will be 16% of risk-weighted assets, increasing to 18% on January 
1, 2022. Non-compliance with TLAC could impede a bank’s ability to issue dividends. 

Some countries are requiring greater capital than the minimums set by the Basel 
Committee and the Financial Stability Board or are implementing the rules earlier than 
required. An example of a country requiring that its banks hold high amounts of capital 
is Switzerland, which has two G-SIBs (UBS and Credit Suisse). The high capital require- 
ments are understandable because the banks are large in relation to the Swiss economy, 
so that a failure of either of them would be catastrophic. 

National regulators designate some banks that have not been classified as G-SIBs as 
domestic systemically important banks (D-SIBs). These banks may be subject to capital 
requirements higher than the minimum, extra disclosure requirements, or (see Chapter 
22) stringent stress tests. In the United States in 2016, eight banks (Bank of America, Bank 
of New York Mellon, Citigroup, Goldman Sachs, JPMorgan Chase, Morgan Stanley, State 
Street, and Wells Fargo) were designated as G-SIBs, and all other banks with assets over 
$50 billion were designated as D-SIBs. 


10 See Basel Committee on Banking Supervision, “Global Systemically Important Banks: Updated 
Assessment Methodology and the Higher Loss Absorbency Requirement,” July 2013. 
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16.3 Contingent Convertible Bonds 


An interesting development in the capitalization of banks has been what are known 
as contingent convertible bonds (CoCos). Traditionally, convertible bonds have been bonds 
issued by a company where, in certain circumstances, the holder can choose to convert 
them into equity at a predetermined exchange ratio. Typically the bond holder chooses 
to convert when the company is doing well and the stock price is high. CoCos are 
different in that they automatically get converted into equity when certain conditions 
are satisfied. Typically, these conditions are satisfied when the company is experiencing 
financial difficulties. 

CoCos are attractive to banks because in normal times the bonds are debt and allow 
the bank to report a relatively high return on equity. When the bank experiences financial 
difficulties and incurs losses, the bonds are converted into equity and the bank is able to 
continue to maintain an equity cushion and avoid insolvency. From the point of view 
of regulators, CoCos are potentially attractive because they avoid the need for a bailout. 
Indeed, the conversion of CoCos is sometimes referred to as a “bail-in.’ New equity 
for the financial institution is provided from within by private-sector bondholders rather 
than from outside by the public sector. 

A key issue in the design of CoCos is the specification of the trigger that forces 
conversion and the way that the exchange ratio (number of shares received in exchange 
for one bond) is set. A popular trigger in the bonds issued so far is the ratio of Tier 1 
equity capital to risk-weighted assets. Another possible trigger is the ratio of the market 
value of equity to book value of assets. 

Lloyd’s Banking Group, Rabobank Nederlands, and Credit Suisse were among 
the first banks to issue CoCos. Business Snapshot 16.1 provides a description of the 
bonds issued by Credit Suisse in 2011. These bonds get converted into equity if either 
Tier 1 equity capital falls below 7% of risk-weighted assets or the Swiss bank regu- 
lators determine that the bank requires public-sector support. It has been estimated 
that over $1 trillion of CoCos will be issued by banks during the decade begin- 
ning 2010 as they respond to the new Basel III regulatory requirements on capital 
adequacy. 

CoCos (prior to conversion) qualify as additional Tier 1 capital if the trigger, defined 
in terms of the ratio of Tier 1 equity capital to risk-weighted assets, is set at 5.125% or 
higher. Otherwise they qualify as Tier 2 capital. 


16.4 Use of Standardized Approaches and SA-CCR 


In December 2017, the Basel Committee announced that, starting in 2022, it would 
require the implementation ofa standardized approach for all capital calculations. A bank’s 
total capital requirement will be the maximum of (a) that calculated as before using 
approved internal models and (b) a certain percentage of that given by the standardized 
approaches. The percentage will 50% in 2022, rising to 72.5% in 2027. 
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BUSINESS SNAPSHOT 16.1 
Credit Suisse’s CoCo Bond Issues 


On February 14, 2011, Credit Suisse announced that it had agreed to exchange 
$6.2 billion of existing investments by two Middle Eastern investors, Qatar 
Holding LLC and the Olayan Group LLC, for CoCo bonds. The bonds automat- 
ically convert into equity if either of the following two conditions are satisfied: 


1. The Tier 1 equity capital of Credit Suisse falls below 7% of risk-weighted 
assets. 

2. The Swiss bank regulator determines that Credit Suisse requires public- 
sector support to prevent it from becoming insolvent. 


Credit Suisse followed this announcement on February 17, 2011, with a 
public issue of $2 billion of CoCos. These securities have similar terms to ones 
held to the Middle Eastern investors and were rated BBB+ by Fitch. They mature 
in 2041 and can be called any time after August 2015. The coupon is 7.875%. 
Any concerns that the market had no appetite for CoCos were alleviated by this 
issue. It was 11 times oversubscribed. 

Credit Suisse indicated that it plans to satisfy one-third of the non-equity 
capital requirement with bonds similar to those just described and two-thirds of 
the non-equity capital requirement with bonds where the conversion trigger is 
about 5% (rather than 7%) of risk-weighted assets. 


The standardized approach for market risk capital applicable in 2022 will be explained 
in Chapter 18. As explained in Chapter 23, operational risk capital will be based entirely 
on a standardized approach. The standardized risk weights for credit risk are a revision 
of the Basel II weights in Section 15.8.1. 

The CEM standardized approach for calculating exposure at default (EAD) 
for derivatives in Basel I has been improved with what is known as SA-CCR."! 
EAD is 


1.4 x (RC + PFE) 


where RC is the replacement cost and PFE is potential future exposure. For transactions 
where no collateral is posted, the RC is calculated as 


RC = max(V, 0) 


"See Basel Committee on Banking Supervision, “The Standardized Approach for Measuring Coun- 
terparty Credit Risk Exposures,” March 2014. 
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For transactions that are subject to margin agreements this is adjusted to 
RC = max(V — C, D, 0) 


where C is the net value of the collateral currently posted by the counterparty (with the 
collateral posted by the bank being considered negative). The variable D is the change 
in the V that could occur without any additional collateral being received (e.g., because 
of minimum transfer amounts in the collateral agreement). It reflects the terms of the 
collateral agreement. For example, if V = C = 50 and V could increase to 51 without 
any additional collateral being received, D = 1 and the RC = 1. In the case of non-cash 
collateral, a haircut is applied in the calculation of C. 

The PFE is a relatively sophisticated way of calculating the add-on to the current 
exposure. Two factors taken into account are: 


1. When the counterparty has posted excess collateral, risk is reduced and the reduction 
in risk increases as the amount of the excess collateral increases. 

2. When no collateral is posted, credit exposure reduces as V becomes more negative. 
The PFE add-on amount when V is slightly negative should be greater than when 
V is highly negative. 


16.5 Dodd—Frank Act 


The Dodd—Frank Act in the United States was signed into law in July 2010. Its aim is to 
prevent future bailouts of financial institutions and protect the consumer. A summary of 
the main regulations is as follows: 


1. Two new bodies, the Financial Stability Oversight Council (FSOC) and the 
Office of Financial Research (OFR), were created to monitor systemic risk and 
research the state of the economy. Their tasks are to identify risks to the finan- 
cial stability of the United States, promote market discipline, and maintain investor 
confidence. 

2. The orderly liquidation powers of the Federal Deposit Insurance Corporation 
(FDIC) were expanded. The Office of Thrift Supervision was eliminated. 

3. The amount of deposits insured by the FDIC was increased permanently to $250,000. 
(Previously, the $250,000 limit had been regarded as temporary.) 

4. Regulations were introduced requiring large hedge funds and similar financial inter- 
mediaries to register with the SEC and report on their activities. 

5. A Federal Insurance Office was created to monitor all aspects of the insurance indus- 
try and work with state regulators. 

6. Proprietary trading and other similar activities of deposit-taking institutions were 
curtailed. This is known as the “Volcker rule” because it was proposed by former 
Federal Reserve chairman Paul Volcker. 
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7. 


8. 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


Some high-risk trading operations were required to be spun off into separately cap- 
italized affiliates. 

Standardized over-the-counter derivatives must be traded on electronic platforms 
known as swap execution facilities (SEFs). These are similar to exchanges and will, 
it is hoped, make prices in the OTC market more transparent. Standardized over- 
counter derivatives between financial institutions must be cleared by central clearing 
parties (CCPs). (See Chapter 17 for a further discussion of CCPs.) The Commodity 
Futures Trading Commission (CFTC) was given responsibility for monitoring the 
activities of CCPs and SEFs. 


. The Federal Reserve was required to set risk management standards for systemically 


important financial utilities engaged in activities such as payment, settlement, and 
clearing. 

Protection for investors was increased and improvements were made to the regulation 
of securities. 

Rating agencies were required to make the assumptions and methodologies behind 
their ratings more transparent and the potential legal liabilities of rating agencies were 
increased. An Office of Credit Ratings was created at the SEC to provide oversight 
of rating agencies. 

The use of external credit ratings in the regulation of financial institutions was dis- 
continued. (This provision of the Act brings Dodd—Frank into direct conflict with 
the Basel Committee, which, as we have seen in this chapter and the last one, does 
make some use of external credit ratings.) 

A Consumer Financial Protection Bureau was created within the Federal Reserve 
to ensure that consumers get clear and accurate information when they shop for 
financial products, such as mortgages and credit cards. 

Issuers of securitized products were required (with some exceptions) to keep 5% of 
each product created. 

Federal bank regulators were required to issue regulations that discourage the use 
of compensation arrangements that might lead to excessive risk taking (e.g., com- 
pensation arrangements based on short-run performance). Shareholders were given 
a non-binding vote on executive compensation. A requirement that board compen- 
sation committees be made up of independent directors was introduced. 

Mortgage lenders were required to make a reasonable good faith determination based 
on verified and documented information that the borrower has the ability to repay 
a loan. Failure to do this might lead to a foreclosure being disallowed. 

Large financial firms were required to have board committees where at least one 
expert has risk management experience at a large complex firm. 

The FDIC was allowed to take over a large financial institution when it failed and 
sell its assets, imposing losses on shareholders and creditors, with the costs of failures 
being paid for by the financial industry. 

FSOC and OFR, which as mentioned earlier have the responsibility of monitoring 
systemic risk, are charged with identifying systemically important financial institu- 
tions (SIFIs). 
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20. The Federal Reserve Board and the FDIC require all SIFIs to prepare what is known 
as living wills, mapping out how they could raise funds in a crisis and how their 
operations would be wound up in the event of failure. 


The Dodd—Frank Act did not define a future role for Fannie Mae and Freddie Mac, 
which were key players in the U.S. mortgage market. These agencies were taken over by 
the U.S. government in September 2008. 

President Trump has indicated his intention to dismantle some parts of Dodd-Frank. 
The Volcker rule and the Consumer Financial Protection Bureau are the most likely 
targets. 


16.6 Legislation in Other Countries 


The large banks are truly global and when regulations vary throughout the world, they 
are liable to move all or part of their operations from one jurisdiction to another to obtain 
more favorable treatment. Although all countries are subject to the same Basel III rules, 
local regulators have some discretion in the application of rules, and legislation is not the 
same across all countries. In 2011, the Swiss bank UBS made headlines by suggesting that 
it might move its investment bank headquarters from Zurich to London, Singapore, or 
New York to avoid the higher capital requirements imposed by Swiss regulators. (It did 
not do this.) 

The previous section outlined the rules introduced by legislators in the United States. 
Legislation in other countries has addressed some of the same issues. In the United 
Kingdom, an independent committee was chaired by Sir John Vickers to consider issues 
facing the banking industry, and new legislation in the form of the Financial Services 
(Banking Reform) Act 2013 was passed. In the European Union a committee headed 
by Erkki Liikanen was set up in November 2011 and published its report in October 
2012. 

Some of the rules and recommendations in the United Kingdom, European Union, 
and other countries are similar to those in the United States. For example, the Dodd- 
Frank Act requires originators of securitized products in the United States to keep 5% of 
all assets created. (See item 14 of the previous list.) A similar provision exists in the Capital 
Requirement Directive 2 (CRD2) of the European Union.'* Most national legislators 
have agreed that standardized over-the-counter derivatives should be cleared through 
central clearing houses instead of being cleared bilaterally (see list item 8). 

Sometimes the rules in different countries are similar but not exactly the same. The 
Dodd-—Frank Act included the requirement that standardized derivatives be traded on 
swap execution facilities (see list item 8). The European Union has introduced similar 
trading platforms, known as organized trading facilities (OTFs). However, it is not as 
dogmatic about requiring their use for all trading in standardized products. 


However, Germany has increased the 5% to 10%. 
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The most controversial aspect of Dodd—Frank is probably the Volcker rule (see item 
6), which prohibits banks from proprietary trading and restricts investment in hedge funds 
and private equity companies by banks and their affiliates. The rationale for the rule is 
reasonable: Banks should not be allowed to speculate with depositors’ funds, because those 
funds are insured by the FDIC. The rule may be difficult to enforce, though, because for 
a bank with a large portfolio of positions, it can be difficult to tell whether a particular 
new trade is entered into for hedging or speculative purposes.'? In spite of this, many U.S. 
banks have closed down their proprietary trading desks, and some employees working 
on those desks have left to join hedge funds. The Vickers committee in the United 
Kingdom recognized the importance of protecting depositors’ funds but did not go so 
far as to prohibit proprietary trading. The retail operations of banks must (by 2019) be 
ring-fenced so that they are insulated from riskier activities such as trading and investment 
banking. The Liikanen committee in the European Union similarly requires a separation 
of core banking activities from proprietary trading and other high-risk activities. 

Regulators in most countries consider living wills (see list item 20) to be important 
for SIFIs and are applying pressure on SIFIs to develop them. SIFIs tend to have devel- 
oped complex organizational structures for tax and regulatory purposes. The living will 
requirement may result in this being simplified so that the different activities of a SIFI 
are in separately capitalized legal entities, not all of which need to be bailed out in the 
event of the SIFI’s failure. Regulators have the option of forcing SIFIs to divest certain 
operations, or even break up entirely, if their living wills are deemed unsatisfactory. They 
hope that living wills avoid a replay of the panic and cross-border litigation that erupted 
when the Lehman Brothers investment bank collapsed in 2008. They also think that the 
multiyear litigation concerned with the termination of Lehman’s derivatives portfolio 
could have been avoided if Lehman and its ISDA master agreements had been structured 
so that it was possible to keep the portfolio alive and to unwind it in an orderly way in 
the few days after bankruptcy. 

Compensation is an important issue. Pre-crisis, the annual bonus was a large part of 
the compensation for many traders and other employees and led them to have a relatively 
short-term horizon in their decision making. If losses were incurred after the payment of 
a bonus, they did not have to return the bonus. Many banks have recognized the problem 
and voluntarily moved to bonuses that are deferred by being spread out over three to 
five years, rather than all being paid in one year. If a trader shows good results in one 
year and bad results in the next, some of the bonus applicable to the good year will be 
deferred and then “clawed back” during the bad year. The Dodd—Frank Act restrictions 
on pay in the financial sector are relatively mild. When financial institutions received 
funds during the crisis under the Troubled Asset Relief Program (TARP), compensation 
was restricted. But, as soon as the funds were paid back, banks had much more freedom 
in their compensation arrangements.'* 


13A joke circulating on Wall Street was that banks would have to employ psychologists or psychiatrists 
to determine what was going on in the mind of the trader at the time of a trade! 
'4Indeed, this was a major incentive to repay the TARP money as quickly as possible! 
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Some other countries have restricted compensation. Sometimes the restriction is 
temporary. For example, in 2009 the United Kingdom introduced a one-time “supertax” 
on bonuses in excess of £25,000. But in other cases it is permanent. For example, in April 
2013, the European Union introduced CRD 4, which imposes a cap on bankers’ bonuses. 
Under this directive the maximum ratio of bonus to fixed pay is 1.0 (with some flexibility 
to increase that ratio to 2.0 with shareholder approval). 


Summary 


The financial crisis that started in 2007 was the worst that many parts of the world 
had seen since the 1930s. Some financial institutions failed. Others had to be bailed out 
with taxpayers’ money. Not surprisingly, both the Basel Committee and national gov- 
ernments decided that a major overhaul of the regulations affecting financial institutions 
was required. 

The Basel II.5 regulations increased the capital banks were required to keep for 
market risk. They recognized that capital should reflect the volatilities and correlations 
experienced during stressed market conditions as well as during normal market condi- 
tions; they eliminated some of the ways banks could reduce regulatory capital by moving 
items from the banking book to the trading book; and they created a special capital 
requirement for derivatives dependent on credit correlation, which had been a particular 
problem during the crisis. 

Basel III dramatically increased the amount of equity capital banks were required 
to keep. It also recognized that many of the problems of banks during the crisis were 
liquidity problems and imposed new liquidity requirements for financial institutions. 

National governments have also introduced new rules for financial institutions. In 
the United States, the Dodd—Frank Act has many provisions designed to protect con- 
sumers and investors, avoid future bailouts, and monitor the functioning of the financial 
system more carefully. Similar, though not necessarily identical, regulations exist in other 
countries. 

Exactly how Basel III and national legislation such as Dodd—Frank will be imple- 
mented is still somewhat uncertain—and this uncertainty is one of the major risks that 
banks face. How successful will the measures be once they have been implemented? We 
will not know this for some time. One problem facing regulators is what are referred to 
as unintended consequences. Basel I had the unintended consequence of discouraging loans 
to high-quality corporations because of the 100% risk weight that would be assigned. 
The 1996 Amendment and the development of the credit derivatives market that came 
after it encouraged banks to find ways of moving credit risks from the banking book to 
the trading book in order to reduce capital requirements. There will no doubt be unin- 
tended consequences of Basel III and the legislation that is being introduced throughout 
the world. Hopefully, the benefits of the new measures will outweigh any harm to the 
financial system arising from the unintended consequences. 
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Practice Questions and Problems 
(Answers at End of Book) 


16.1 What are the three major components of Basel II.5? 

16.2 What are the six major components of Basel III? 

16.3 What is the difference between VaR as it has been traditionally measured and 
stressed VaR? 

16.4 Explain how the incremental risk charge is calculated. Why was it introduced by 
the Basel Committee? 

16.5 What is the difference between the capital required for a AAA-rated ABS with 
principal of $100 million and a AAA-rated ABS CDO with a principal of $100 
million using the standardized approach? 

16.6 By how much has the Tier 1 equity capital (including the capital conservation 
buffer) increased under Basel III compared with the Tier 1 equity capital require- 
ment under Basel I and II? 

16.7 Suppose that the Tier 1 equity ratio for a bank is 6%. What is the maximum 
dividend, as a percent of earnings, that can be paid if (a) there is no countercyclical 
buffer and (b) there is a 2.5% countercyclical buffer? 

16.8 Explain how the leverage ratio differs from the usual capital ratios calculated by 
regulators. 

16.9 Explain how the liquidity coverage ratio and the net stable funding ratio are 
defined. 

16.10 How would the net stable funding ratio in Example 16.1 change if half the whole- 
sale deposits were replaced by stable retail deposits? 

16.11 What is CVA? What new regulations concerning CVA were introduced in Basel 
Ill? 

16.12 Explain how CoCo bonds work. Why are they attractive to (a) banks and (b) 
regulators? 
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16.13 Explain one way that the Dodd—Frank Act is in conflict with (a) the Basel 
international regulations and (b) the regulations introduced by other national 


16.14 


Further Questions 


governments. 


A bank has the following balance sheet: 


Cash 3 
Treasury Bonds (>1 yr) 5 
Corporate Bonds Rated A 4 
Mortgages 18 
Small Business Loans (<1 yr) 60 
Fixed Assets 10 

100 


Retail Deposits (stable) 25 
Retail Deposits (less stable) 15 
Wholesale Deposits 44 
Preferred Stock (>1 yr) 4 
Tier 2 Capital 3 
Tier 1 Capital 9 

100 


(a) What is the Net Stable Funding Ratio? 


(b) The bank decides to satisfy Basel III by raising more (stable) retail deposits and 
keeping the proceeds in Treasury bonds. What extra retail deposits need to be 


raised? 


Chapter 17 


Regulation of the OTC 
Derivatives Market 


n Chapter 5, we distinguished between the over-the-counter (OTC) and exchange- 
[=< markets. The exchange-traded market is a market where products devel- 

oped by an exchange are bought and sold on a trading platform developed by the 
exchange. A market participant’s trade must be cleared by a member of the exchange 
clearing house. The exchange clearing house requires margin (i.e., collateral) from its 
members, and the members require margin from the brokers whose trades they are clear- 
ing. The brokers in turn require margin from their clients. 

The OTC market is a market where financial institutions, fund managers, and cor- 
porate treasurers deal directly with each other. An exchange is not involved. Before the 
2007-2008 credit crisis, the OTC market was largely unregulated. Two market par- 
ticipants could enter into any trade they liked. They could agree to post collateral or 
not post collateral. They could agree to clear the trade directly with each other or 
use a third party. Also, they were under no obligation to disclose details of the trade to 
anyone else. 

Since the crisis, the OTC market has been subject to a great deal of regulation. This 
chapter will explain the regulations and show that regulatory pressure is leading to the 
OTC market becoming more like the exchange-traded market. 
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Figure 17.1 Bilateral and Central Clearing 


17.1 Clearing in OTC Markets 


We start by describing how transactions are cleared in the OTC market. There are two 
main approaches: central clearing and bilateral clearing. They are illustrated schematically 
in Figure 17.1 (which makes the simplifying assumption that there are only eight market 
participants and only one CCP). In bilateral clearing, market participants clear transac- 
tions with each other. In central clearing, a third party, known as a central counterparty 
(CCP), clears the transactions. 


17.1.1 Margin 


Before proceeding to describe bilateral and central clearing in more detail, we review 
the operation of margin accounts. Margin is the word now used to describe the collateral 
posted in OTC markets as well as exchange-traded markets. 

Variation margin 1s the collateral posted to reflect the change in the value of a deriva- 
tives portfolio. Consider the situation where Party A is trading with Party B and the 
collateral agreement states that variation margin (with no threshold or minimum trans- 
fer amount) has to be posted by both sides.’ This means that, if the value of outstanding 
transactions changes during a day so that they increase in value by $X to A (and there- 
fore decrease in value by $X to B), B has to provide A with $X of acceptable collateral. 
The cumulative effect of variation margin is that, if outstanding derivatives have a value 
of +$V to A and -$V to B at a particular time, B should have posted a total of $V of 
collateral with A by that time? 

Variation margin provides some protection against a counterparty default. It would 
provide total protection in an ideal world where (a) the counterparty never owes any 
variation margin at the time of default and (b) all outstanding positions can be replaced 


1A and B could be two derivatives dealers or a derivatives dealer and one of its clients. Also, one of 
A and B could be a CCP. A threshold is a minimum value of the portfolio to one side before it can 
demand margin, and the minimum transfer amount is the minimum change in value necessary for a 
margin to have to be posted. 

2In this context, note that if A buys an option from B for $10,000, it must pay $10,000 to B, but B 
must then return the $10,000 to A as variation margin. 
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at mid-market prices as soon as the counterparty defaults. In practice, defaulting coun- 
terparties often stop posting collateral several days before they default, and the non- 
defaulting counterparty is usually subject to a bid-offer spread as it replaces transactions? 
To allow for adverse movements in the value of the portfolio during a period prior to 
default when no margin is being posted, market participants sometimes require initial 
margin in addition to variation margin. Note that, in this context, adverse market move- 
ments are increases in the value of the portfolio to the non-defaulting party, not decreases. 
This is because increases in the value during a period when variation margin is not being 
posted lead to increases in replacement costs. Initial margin, which can change through 
time as the outstanding portfolio and relevant volatilities change, reflects the risk of a loss 
due to adverse market moves and the costs of replacing transactions. 

Most margin is cash, but the agreements in place may specify that securities can be 
posted instead of cash. The securities may be subject to a haircut. This means that the 
market value of the securities is reduced to determine their value for margin purposes. 
For example, a Treasury bond might be subject to a 10% haircut, indicating that, if its 
market value were $100, it would cover only $90 of a margin requirement. 

Should cash margin earn interest? There is a difference between futures markets and 
OTC markets here. A futures exchange clearing house requires both initial margin and 
variation margin from members. Members earn interest on the initial margin. But they do 
not do so on variation margin because futures contracts are settled daily so that variation 
margin does not belong to the member posting it. In the case of OTC trades, interest is 
usually earned on all cash margin posted because trades are not settled daily. 


17.1.2 Central Clearing 


In central clearing, a central counterparty (CCP) handles the clearing. A CCP operates 
very much like an exchange clearing house. When two companies, A and B, agree to an 
over-the-counter derivatives transaction and decide to clear it centrally, they present it 
to a CCP. Assuming that the CCP accepts it, the CCP acts as an intermediary and enters 
into offsetting transactions with the two companies. 

Suppose, for example, that the transaction is an interest rate swap where company A 
pays a fixed rate of 5% to company B on a principal of $100 million for five years and 
company B pays LIBOR to company A on the same principal for the same period of 
time. Two separate transactions are created. Company A has a transaction with the CCP 


3 As explained later, the non-defaulting counterparty is able to claim from the defaulting party the 
costs related to the bid-offer spread that it would incur in replacing the transaction. 

“Tt may seem strange that a market participant would be worried about the value of its transactions 
increasing. But suppose a transaction with a defaulting counterparty is hedged with another transaction 
entered into with another counterparty (as is often the case). The transaction with the other party can 
be expected to lose value without any compensating gain on the defaulted transaction. 

5 As indicated earlier, the non-defaulting party is allowed to keep all margin posted by the defaulting 
party up to the amount that can be legitimately claimed. 
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Figure 17.2 Role of CCP in OTC Markets 


where it pays 5% and receives LIBOR on $100 million. Company B has a transaction with 
the CCP where it pays LIBOR and receives 5% on $100 million. The two companies 
no longer have credit exposure to each other. This is illustrated in Figure 17.2. If one or 
both parties to the transaction are not members of the CCP, they can clear the transaction 
through members. 

Three large CCPs are 


1. SwapClear (part of LCH Clearnet in London), 
2. ClearPort (part of the CME Group in Chicago), and 
3. ICE Clear Credit (part of the Intercontinental Exchange). 


A CCP requires its members to provide initial margin and variation margin for the 
transactions being cleared. Typically, the initial margin is calculated so that there is a 99% 
probability that it will cover market moves over five days. This protects the CCP from 
losses as it tries to close out or replace the positions of a defaulting member. 

Consider the swap in Figure 17.2.Suppose for simplicity that it is the only transaction 
each side has with the CCP. The CCP might require an initial margin of $0.5 million 
from each side. If, on the first day, interest rates fall so that the value of the swap to A 
goes down by $100,000, Party A would be required to pay a variation margin equal to 
this to the CCP, and the CCP would be required to pay the same amount to B. There 
could also be a change to the initial margin requirements determined by the CCP. If 
required margin is not paid by one of its members, the CCP closes out its transactions 
with that member. Cash and Treasury instruments are usually accepted as margin by 
CCPs. Typically the interest rate paid on cash balances is close to the overnight federal 
funds rate for U.S. dollars (and close to similar overnight rates for other currencies). 

In practice, market participants are likely to have multiple transactions outstanding 
with the CCP at any given time. The initial margin required from a participant at any 
given time reflects the volatility of the value of its total position with the CCP. The role 
of a CCP in the OTC market is similar to the role of a clearing house in the exchange- 
traded market. The main difference is that transactions handled by the CCP are usually 
less standard than transactions in the exchange-traded market so that the calculation of 
margin requirements is more complicated. 
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The key advantage of clearing a transaction through a CCP is that OTC market 
participants do not need to worry about the creditworthiness of the counterparties they 
trade with. Credit risk is handled by the CCP using initial and variation margin. 

ACCP requires its members to contribute to a default fund. (As mentioned, if one or 
both parties to a transaction are not members of the CCP, they can clear the transaction 
through members. They will then have to post margin with the members.) Ifa member 
fails to post margin when required, the member is in default and its positions are closed 
out. In closing out a member’s positions, the CCP may incur a loss. A waterfall defines 
who bears the loss. The order in which the loss is funded is usually as follows: 


1. The initial margin of the defaulting member 

2. The default fund contribution of the member 

3. The default fund contributions of other members 
4. The equity of the CCP® 


This is similar to the way losses in the event of a default are funded by an exchange 
clearing house. 


17.1.3 Bilateral Clearing 


In bilateral clearing, each pair of market participants enters into an agreement describing 
how all future transactions between them will be cleared. Typically this is an ISDA master 
agreement. (ISDA is short for International Swaps and Derivatives Association.) An annex 
to the agreement, known as the credit support annex (CSA), defines collateral arrangements. 
In particular, it defines what collateral (if any) has to be posted by each side, what assets 
are acceptable as collateral, what haircuts will be applied, and so on. The main body of the 
agreement defines what happens when one side defaults (e.g., by declaring bankruptcy, 
failing to make payments on the derivatives as they are due, or failing to post collateral 
when required). We will discuss this in more detail shortly. 


17.1.4 Netting 


We discussed netting in connection with the Basel I rules in Section 15.5. Netting is a 
feature of ISDA master agreements and a feature of the agreements between CCPs and 
their members. It states that all transactions between two parties are considered to be a 
single transaction when (a) collateral requirements are being calculated and (b) early ter- 
minations occur because of a default. As explained in Section 15.5, netting reduces credit 
risk because it means that the defaulting party cannot choose to default on transactions 
that are out-of-the-money while keeping transactions that are in-the-money. 


In some cases, the non-defaulting members are required to provide additional default fund contri- 
butions when there is a default, with a cap on the amount of these additional contributions. (This is 
true of both exchange clearing houses and CCPs.) 
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Netting can also save initial margin. Suppose Party A has two transactions with a 
CCP that are not perfectly correlated. The initial margin for the portfolio is likely to be 
less than that for the two transactions separately. 


17.1.5 Events of Default 


Derivatives transactions are treated differently from other transactions in the event that a 
market participant fails to meet its obligations. For example, in ISDA master agreements 
there is an early termination provision that takes precedence over bankruptcy rules. This 
states that, if there is an “event of default,” the non-defaulting party has the right to ter- 
minate all transactions with the defaulting party after a short period of time has elapsed.’ 
Events of default include declarations of bankruptcy, failure to make payments as they are 
due, and failure to post collateral when required. Non-derivative contracts cannot always 
be terminated in this way. Another important difference between derivatives transactions 
and non-derivatives transactions is that in the case of derivatives transactions the non- 
defaulting party can take immediate possession of any collateral that has been posted by 
the defaulting party. It does not have to get a court order to allow it to do this. 

If there is an event of default under an ISDA master agreement, the non-defaulting 
party calculates the mid-market value of outstanding transactions. It then adjusts this 
valuation in its favor by half the bid-offer spreads on the transactions for the purposes of 
calculating a settlement amount. This adjustment is compensation for the fact that it will 
have to trade with other dealers to replace the transactions and it will be subject to their 
bid-offer spreads when it does so. Suppose that one transaction has a mid-market value 
of $20 million to the non-defaulting party and that the transaction is bid $18 million, 
offer $22 million. For the purposes of settlement, the transaction would be valued at 
$22 million because this is what it would cost the non-defaulting party to replace the 
defaulting party’s position in the transaction. If the non-defaulting party had the other 
side of the transaction so that its mid-market value were —$20 million, it would be valued 
at —$18 million for settlement purposes. In this case, the assumption is that a third party 
would be prepared to pay only $18 million to take the defaulting party’s position. 


17.2 Post-Crisis Regulatory Changes 


The OTC derivatives market was considered by many to have been partly responsible 
for the 2008 credit crisis. When the G20 leaders met in Pittsburgh in September 2009 
in the aftermath of the 2008 crisis, they wanted to reduce systemic risk by regulating 


7 The non-defaulting party is not obliged to terminate transactions. Counterparties that are out-of- 
the-money sometimes consider that it is in their best interests not to terminate. 

8 Failure resolution mechanisms have been proposed where transactions are stayed (i.e., not terminated) 
for a period of time even if there is a bankruptcy filing, provided margin/collateral continues to be 
posted. These would allow the derivatives portfolios of bankrupt market participants to be unwound 
in an orderly way. 
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the OTC market. The statement issued by the leaders after the meeting included the 
following paragraph: 


All standardized OTC derivative contracts should be traded on exchanges or electronic 
trading platforms, where appropriate, and cleared through central counterparties by end- 
2012 at the latest. OTC derivative contracts should be reported to trade repositories. Non- 
centrally cleared contracts should be subject to higher capital requirements. We ask the FSB 
and its relevant members to assess regularly implementation and whether it is sufficient to 
improve transparency in the derivatives markets, mitigate systemic risk, and protect against 
market abuse. 


The results of this were three major changes affecting OTC derivatives: 


1. A requirement that all standardized OTC derivatives be cleared through CCPs. Stan- 
dardized derivatives include plain vanilla interest rate swaps (which account for the 
majority of OTC derivatives traded) and default swaps on credit indices. The pur- 
pose of this requirement is to reduce systemic risk (see Business Snapshot 15.1). It 
leads to derivatives dealers having less credit exposure to each other so that their 
interconnectedness is less likely to lead to a collapse of the financial system. 

2. A requirement that standardized OTC derivatives be traded on electronic platforms. 
This is to improve transparency. The thinking is that, if there is an electronic plat- 
form for matching buyers and sellers, the prices at which products trade should be 
readily available to all market participants? The platforms are called swap execution 
facilities (SEFs) in the United States and organized trading facilities (OTFs) in Europe. 
In practice, standardized products once they have been traded on these platforms are 
passed automatically to a CCP. 

3. A requirement that all trades in the OTC market be reported to a central trade 
repository. This requirement provides regulators with important information on the 
risks being taken by participants in the OTC market. It is partly a response to the AIG 
fiasco where regulators were not aware of the huge risks being taken by a subsidiary 
of AIG until the insurance company asked to be bailed out. 


The first two of these requirements apply only to transactions between two finan- 
cial institutions (or between a financial institution and a non-financial company that is 
considered to be systemically important because of the volume of its OTC derivatives 
trading). Derivatives dealers can therefore continue to trade with many of their non- 
financial corporate clients in the same way that they did pre-crisis. 

About 25% of OTC transactions were cleared through CCPs pre-crisis and the 
remaining 75% were cleared bilaterally. As a result of the new rules, these percentages 
have flipped so that approximately 75% of OTC transactions are now cleared through 
CCPs while 25% are cleared bilaterally. 


? An issue here is that the type of electronic platform that is appropriate for swaps may not be the same 
as the one that is used by exchanges. Swaps are traded intermittently with large notional principals. 
Futures and options on an exchange trade continually and the size of trades is usually much smaller. 
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17.2.1 Uncleared Trades 


Following another G20 meeting in 2011, the rules have been tightened for non-standard 
OTC derivatives. These are the derivatives that are not covered by the rules just men- 
tioned. They are cleared bilaterally rather than centrally and are referred to as uncleared 
trades. Regulations, which are being implemented between 2016 and 2020, require 
uncleared trades between two financial institutions (or between a financial institution 
and a non-financial company that is considered to be systemically important) to be sub- 
ject to rules on the margin that has to be posted. Previously, one of the attractions of 
bilateral clearing was that market participants were free to negotiate any credit support 
annex to their ISDA master agreements. 

The rules state that both initial margin and variation margin must be posted for 
uncleared trades by both sides. Variation margin was fairly common in the OTC market 
pre-crisis (particularly in trades between derivatives dealers), but initial margin was rare. 
When entering into a transaction with a much less creditworthy counterparty, a deriva- 
tives dealer might insist on the counterparty posting initial margin. But the posting of 
initial margin by both sides was almost unheard of in the bilaterally cleared market. 

Variation margin is usually transmitted directly from one counterparty to the other. 
Initial margin when posted by both sides cannot be handled in this way. If, for example, 
A transmitted $1 million of initial margin to B and B transmitted $1 million of initial 
margin to A, the initial margin would not serve the desired purpose because the transfers 
would cancel each other. For this reason the regulations require initial margin to be 
transmitted to a third party, where it is held in trust. 


17.2.2 Determination of Initial Margin: SIMM 


For the new rules on uncleared transactions to work, the two sides to an ISDA master 
agreement must agree on the variation margin and initial margin. The variation margin 
requires agreement on the valuation of outstanding transactions, and procedures have 
been established for resolving any disagreements on this. The calculation of initial margin 
is more complicated than valuing the transactions and there is more scope for different 
models to give different results. As a result there have been attempts to develop an industry 
standard. 

Initial margin is specified in the regulations for portfolios of uncleared transactions 
between two parties as the gain in value over 10 days that we are 99% certain will 
not be exceeded in stressed market conditions. Note that initial margin is the mirror 
image of VaR. When we are calculating VaR, we are determining extreme percentiles 
of the loss distribution, but when we are calculating initial margin we are determining 
extreme percentiles of the gain distribution. This is because exposure increases as the 
uncollateralized value of a portfolio increases. 

The Basel Committee proposed a grid approach for calculating initial margin, which 
specified initial margin as a percentage of notional principal for different types of transac- 
tions. This was unpopular because it did not incorporate netting. If a market participant 
entered into a certain transaction on Day 1 and an almost offsetting transaction on Day 
5, both with the same counterparty, the initial margin on Day 5 would be almost double 
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that on Day 1—even though the net exposure to the counterparty would be close to 
zero. ISDA proposed what is known as the Standard Initial Margin Model (SIMM) as a 
way of overcoming this. This model has now been approved by regulators. 

SIMM uses the model-building approach described in Chapter 14.'° Delta and vega 
risks are handled using the weighted sensitivities and risk weights as in equation (14.8) 
so that 


IM (Delta and Vega) = 


where the W, is the risk weight for risk factor i (specified by the regulators), 6; is the 
sensitivity of the position held to risk factor i (determined by the bank), and p; is the 
correlation between risk factors i and j (specified by the regulators). Because a 10-day 
time horizon with 99% confidence is used, a possible formula for W, is 


W, = V10 x N7!(0.99)o; (17.1) 


where 6, is the daily volatility (or standard deviation, in the case of interest rates, credit 
spreads, and volatilities) of the ith risk factor in stressed market conditions. 

To calculate the incremental effect on initial margin of gamma risk, SIMM first con- 
siders the situation where all deltas are zero and there is no cross gamma. From equations 
(14.11) and (14.12), the mean and standard deviation of the change in the value of the 
portfolio over one day are: 


E(AP) = = Dy vo 


SD(AP) = 


1 
LY phasis 
ij 


where y; is the gamma with respect to the ith risk factor. 
Estimates of the mean and standard deviation of portfolio change over 10 days are 


obtained by replacing 6; with y 100,. Defining 


C; = sil V 106)? 


the mean, m, and standard deviation, s, of the 10-day change are therefore given by 


10 See “ISDA SIMM: From Principles to Model Specification,” ISDA, March 3, 2016. 
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Figure 17.3 Relation between À and f 


SIMM then sets 
IM (Gamma) = m + as/V2 


The parameter A in this equation is (see Problem 17.14) defined in terms of 


Li C, 
Xi IC; 


P 


as indicated in Figure 17.3. This relationship produces results that have the right prop- 
erties and correspond closely with tests carried out using Monte Carlo simulation. 

There are a number of other details in SIMM. To simplify matters, gamma is calcu- 
lated from vega using the relationship between the two that holds for European options. 
Risk factors are divided into buckets, and some risk factors involve term structures with 
vertices and are handled as we have described in Chapter 14. There are rules specified 
for calculating the correlations p; both within buckets and between buckets. 


17.3 Impact of the Changes 


The new regulations have led to a world where more collateral is required for OTC 
derivatives transactions. Pre-crisis, most OTC transactions were cleared bilaterally and 
an initial margin was usually not required. Under the new regulations, most transactions 
will be cleared through CCPs where both initial and variation margin will be required 
from both sides. Furthermore, transactions that are cleared bilaterally between financial 
institutions will require even more collateral than they would if they could be cleared 
through CCPs. 
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Figure 17.4 Example Where There Are Three Market Participants, One CCP, and Two Product 
Types 
One product type (represented by dotted lines) can be cleared; the other (represented by solid lines) 


cannot. 


As discussed by Duffie and Zhu, there is one potential partial offset to the huge 
increase in collateral requirements mandated by the new rules.'! Under central clearing 
there is the potential for more netting. In Figure 17.1, under bilateral clearing a market 
participant has many different netting sets, one for each of the other market participants. 
Under central clearing, there is only one netting set. Bank A can, for example, net its 
transactions where Bank B is the counterparty with its transactions where Bank C is the 
counterparty, provided that all go through the same CCP. 

Figure 17.1, however, is a simplification. It suggests that the choice is between a 
100% bilateral world and a world where all transactions are cleared through a single 
CCP. The reality is that (a) there will be a number of CCPs and it is quite likely that they 
will not cooperate with each other to reduce initial margin requirements, and (b) some 
transactions will continue to be cleared bilaterally; so banks will face a situation that is a 
mixture of the two worlds depicted in Figure 17.1. 

It is even possible that the new rules requiring the use of CCPs could reduce rather 
than increase netting in some cases. This is illustrated by Figure 17.4, which shows 
the situation where there are three market participants and one CCP. The exposures 
represented by the dotted lines are standard transactions that can be cleared centrally. 
Those represented by the solid lines are nonstandard transactions that cannot be cleared 


"See D. Duffie and H. Zhu, “Does a Central Counterparty Reduce Counterparty Risk?” Review of 
Asset Pricing Studies 1 (2011): 74-95. 
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centrally. For example, in B’s dealings with A, the nonstandard transactions are worth 100 
to B and —100 to A; the standard transactions are worth +50 to A and —50 to B. 

Without central clearing, the average exposure before collateral of the three parties is 
+40. With central clearing, the average exposure is 110 when the exposure to the CCP is 
included and 70 when it is not. Central clearing is likely to increase the collateral market 
participants have to post in this simple situation. This happens because without the central 
clearing rules standard transactions can be netted with nonstandard transactions, but with 
the central clearing rules this is no longer possible. 

Most experts think that there will be an increase in netting, but the overall effect of 
the changes will be an increase in margin requirements. Pre-crisis, relatively few OTC 
derivatives attracted initial margin. Post-crisis, the vast majority of OTC derivatives will 
require initial margin. A related consideration is that, as more transactions are cleared 
through CCPs, more of the funds of a financial institution will be tied up in default fund 
contributions. 


17.3.1 Liquidity 


Most of the collateral required under the new regulations will have to be in the form of 
cash or government securities. An increasingly important consideration for all derivatives 
market participants is therefore liquidity. Not only will the collateral posted at any given 
time be a drain on liquidity, but banks will have to keep a sufficient quantity of liquid assets 
on hand to ensure that they are able to meet any margin calls. (Margin calls from a CCP 
have to be met almost immediately.) As we saw in Chapter 16, Basel III has recognized 
the importance of liquidity by proposing two new liquidity ratios that banks must adhere 
to. Capital has in the past been the key metric in determining the profitability of different 
business units and different projects at a bank. In the future, a two-dimensional metric 
involving capital and liquidity is likely to be used. Often there will be a trade-off between 
capital and liquidity in that a project will look attractive from a capital perspective and 
unattractive from a liquidity perspective, or vice versa. 


17.3.2 Rehypothecation 


Liquidity pressures are likely to increase because of another post-crisis change. What is 
known as “rehypothecation” was common in some jurisdictions (particularly the United 
Kingdom) pre-crisis. (See Business Snapshot 17.1.) It involved a dealer using collateral 
posted with it by one counterparty to satisfy a collateral demand by another counterparty. 
It is estimated that pre-crisis about $4 trillion of collateral was required in derivatives mar- 
kets, but that because of rehypothecation only $1 trillion of new collateral was posted.!? 
In other words, each item of collateral was used on average four times. Rehypothe- 
cation will be restricted under new rules developed by the Basel Committee and the 


!2See M. Singh and J. Aitken, “The (Sizable) Role of Rehypothecation in the Shadow Banking Sys- 
tem,’ Working Paper, International Monetary Fund, 2010. 
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BUSINESS SNAPSHOT 17.1 
Rehypothecation 


A practice in the management of collateral known as rehypothecation can cause 
problems. If Party A posts collateral with Party B and rehypothecation is permit- 
ted, Party B can use the same collateral to satisfy a demand for collateral from 
Party C; Party C can then the use the collateral to satisfy a demand for collateral 
from Party D;and so on. In 2007, it was estimated that U.S. banks had more than 
$4 trillion of collateral, but that this was created by using $1 trillion of original 
collateral in conjunction with rehypothecation. Rehypothecation was particu- 
larly common in the United Kingdom, where title to collateral is transferred. 
After Lehman declared bankruptcy in September 2008, clients (particularly 
European hedge fund clients) found it difficult to get a return of the collateral 
they had posted with Lehman because it had been rehypothecated. As a result of 
this experience, many market participants are more cautious than they used to 
be, and clauses in CSAs banning or limiting rehypothecation are now common. 


International Organization of Securities Commissions (IOSCO). These rules allow initial 
margin to be rehypothecated once, but only if certain conditions are satisfied. Variation 
margin can be rehypothecated. But increasingly dealers themselves impose restrictions 
on rehypothecation because they do not want to be disadvantaged in the same way that 
some of Lehman’s counterparties were (see Business Snapshot 17.1). 


17.3.3 The Convergence of OTC and Exchange-Traded Markets 


The developments we have been discussing are blurring the distinction between OTC 
derivatives and exchange-traded derivatives. Many OTC transactions are now traded on 
platforms similar to exchanges and cleared through organizations similar to exchange 
clearing houses. As time goes by, more OTC transactions are likely to be classified as 
“standard” so that the percentage of OTC transactions handled similarly to exchange- 
traded transactions will increase. What is more, even those OTC transactions between 
financial institutions that are cleared bilaterally may begin to look more like exchange- 
traded transactions. This is because margin has to be posted with a third party, and we 
can expect organizations (somewhat similar to exchange clearing houses) to be set up to 
facilitate this. 

It is also the case that exchanges are increasingly trying to offer less standard prod- 
ucts to institutional investors in an attempt to take business away from the OTC mar- 
ket. As a result, while OTC markets are moving in the direction of becoming more 
like exchange-traded markets, exchange-traded markets are moving in the opposite 
direction and becoming more like OTC markets. Many CCPs and exchanges have a 
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common ownership and will find areas for cooperation on margin requirements and 
business practices. Whether a transaction is being cleared through an exchange or a CCP 
may not be important in the future because it will be handled in the same way by the 
same organization. 


17.4 CCPs and Bankruptcy 


The key objective of regulators is to reduce systemic risk. Some commentators have 
criticized the new derivatives regulations as replacing too-big-to-fail banks by too-big- 
to-fail CCPs. It certainly would be a disaster for the financial system if a major CCP such 
as LCH Clearnet’s SwapClear and CME’s ClearPort were to fail. In theory, as described 
in Hull (2012), it is possible to design the contract between CCPs and their members so 
that it is virtually impossible for a CCP to fail.’ In practice, it is considered important 
that a CCP has “skin in the game.” It is then motivated to take good decisions with 
respect to key issues such as whether a new member should be admitted, how initial 
margins should be set, and so on. 

The main reason why it makes sense to replace too-big-to-fail banks by too-big-to- 
fail CCPs is that CCPs are much simpler organizations than banks. They are therefore 
much simpler to regulate than banks. In essence, regulators need ensure only that the CCP 
follows good practices in (a) choosing members, (b) valuing transactions, and (c) deter- 
mining initial margins and default fund contributions. In the case of banks, a myriad of 
different, much more complex activities must be monitored. It is of course important for 
regulators to ensure that CCPs are not allowed to become more complex organizations 
by expanding outside their core activity of intermediating derivatives transactions. 


Summary 


Prior to the 2007—2008 credit crisis, the over-the-counter (OTC) derivatives market 
was largely unregulated. Two market participants could agree to any transaction they 
liked and then reach any agreement they liked on how the transaction would be cleared. 
They were also free to choose any arrangements they liked for the posting of collateral. 
This is no longer the case. The OTC derivatives market is now subject to a great deal 
of regulation throughout the world. The extent to which the OTC derivatives market 
should be blamed for the crisis is debatable, but post-crisis regulatory changes are having 
more effect on this market than on almost any other sector of the economy. 

Most standard OTC derivatives between two financial institutions must be cleared 
through central counterparties. These are very similar to exchanges. They require ini- 
tial margin and variation margin to be posted by both sides. Nonstandard transactions 


'3See J. Hull, “CCPs, Their Risks, and How They Can Be Reduced,” Journal of Derivatives 20, no. 1 
(Fall 2012): 26-29. 
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between financial institutions will continue to be cleared bilaterally, but are subject to 
regulation on the collateral that must be posted. Specifically, transactions between finan- 
cial institutions are subject to initial margin (segregated) and variation margin (transferred 
from one side to the other when the value of outstanding transactions changes). 

What will the derivatives world look like in 15 or 20 years? Present trends indicate 
that there will be a convergence between OTC and exchange-traded markets, and the 
distinction between the two will become blurred. But it should be acknowledged that 
there is no certainty that this trend will continue. The OTC market as it existed before 
the crisis was very profitable for a few large banks. It is possible that they will chip away 
at the regulations so that they are able eventually to find a way of creating a new OTC 
market somewhat similar to the one that existed before the crisis. A battle is likely to 
take place pitting the determination of regulators against the ingenuity of banks. 
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Practice Questions and Problems 
(Answers at End of Book) 


17.1 What is the difference between using an ISDA master agreement and using a CCP 
for clearing OTC transactions? 

17.2 Explain the new regulations introduced since the 2007—2008 crisis that (a) require 
certain transactions to be cleared using CCPs and (b) require extra collateral for 
some transactions cleared bilaterally. 

17.3 Why might the regulations introduced since the 2007—2008 crisis create liquidity 
problems for some financial institutions? 

17.4 What is meant by a haircut in a collateral agreement? 

17.5 Explain the meaning of “event of default” and an “early termination” in an ISDA 
master agreement. 

17.6 In Figure 17.4 where the CCP is used, suppose that half of the transactions 
between A and B that are represented by the solid line are moved to the CCP. What 
effect does this have on (a) the average exposure of the three parties including 
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17.7 


17.8 


17.9 
17.10 
17.11 


17.12 


17.13 


17.14 


their exposures to the CCP and (b) the average exposure of the three parties 
excluding their exposures to the CCP? 

What credit risks is a company taking when it becomes a member of a CCP and 
clears transactions through the CCP? 

“Netting affects the collateral that has to be posted and the settlement in the event 
of an early termination.” Explain. 

What is rehypothecation? 

Why are CCPs easier to regulate than banks? 

What is claimed by the non-defaulting party in an early termination under an 
ISDA master agreement? 


Further Questions 


“Some aspects of the new regulations can be expected to increase the amount of 
collateral posted for derivatives, and some can be expected to reduce it.” Explain 
this statement. 

In Figure 17.4 where the CCP is used, suppose that an extra transaction between 

A and C that is worth 140 to A is cleared bilaterally. What effect does this have 

on the tables in Figure 17.4? 

Consider a delta-neutral position in a single asset with a gamma (measured with 

respect to percentage changes in the asset) of y (y > 0). Suppose that the 10-day 

return on the asset is normally distributed with a mean of zero and a standard 
deviation ©. 

(a) What is an expression for the change in the portfolio value, AP, over 10 days 
as a function of y,o,and a random sample from a standard normal distribution. 

(b) The square of a standard normal has a chi-squared distribution with one degree 
of freedom. Write the expression for AP in (a) as a function of a random sample 
from such a chi-squared distribution and C, where C = yo?/2. 

(c) The 99 percentile of a chi-squared distribution with one degree of freedom 
is 6.63. (See CHISQ.INV in Excel.) Use this to show that the 99 percentile 
of AP is 6.63C. 

(d) The mean and variance of a chi-squared distribution with one degree of free- 
dom are 1 and 2. Show that this result is consistent with Figure 17.3 and the 
formula for IM (Gamma) in Section 17.2.2. 

(e) Show that when y < 0 the 99 percentile of AP is less than zero so that Fig- 
ure 17.3 then gives IM (Gamma) = 0. 


Chapter 18 


Fundamental Review 


of the Trading Book 


n May 2012, the Basel Committee on Banking Supervision issued a consultative 

document proposing major revisions to the way regulatory capital for market risk 

is calculated. This is referred to as the “Fundamental Review of the Trading Book” 
(FRTB).! The Basel Committee then followed its usual process of requesting comments 
from banks, revising the proposals, and carrying out Quantitative Impact Studies (QISs).? 
The final version of the rules was published by the Basel Committee in January 20162 
This require banks to implement the new rules in 2019, but in December 2017 the 
implemented year was revised to 2022. 

FRTB’s approach to determining capital for market risk is much more complex than 
the approaches previously used by regulators. The purpose of this chapter is to outline 
its main features. 


'See Bank for International Settlements, “Consultative Document: Fundamental Review of the Trad- 
ing Book,” May 2012. 

? QISs are calculations carried out by banks to estimate the impact of proposed regulatory changes on 
capital requirements. 

3 See Bank for International Settlements, “Minimum Capital Requirements for Market Risk,” January 
2016. 
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18.1 Background 


The Basel I calculations of market risk capital were based on a value at risk (VaR) calcu- 
lated for a 10-day horizon with a 99% confidence level. The VaR was “current” in the 
sense that calculations made on a particular day were based on the behavior of market 
variables during an immediately preceding period of time (typically, one to four years). 
Basel II.5 required banks to calculate a “stressed VaR” measure in addition to the cur- 
rent measure. As explained in Sections 13.1 and 16.1, this is VaR where calculations are 
based on the behavior of market variables during a 250-day period of stressed market 
conditions. To determine the stressed period, banks were required to go back through 
time searching for a 250-day period where the observed movements in market variables 
would lead to significant financial stress for the current portfolio. 

FRTB changes the measure used for determining market risk capital. Instead of VaR 
with a 99% confidence level, it uses expected shortfall (ES) with a 97.5% confidence 
level. The measure is actually stressed ES with a 97.5% confidence. This means that, as 
in the case of stressed VaR, calculations are based on the way market variables have been 
observed to move during stressed market conditions. 

For normal distributions, VaR. with a 99% confidence and ES with a 97.5% con- 
fidence are almost exactly the same. Suppose losses have a normal distribution with a 
mean p and standard deviation o. The 99% VaR is u + 2.3260 while the 97.5% expected 
shortfall is u + 2.33804 (See Problem 18.2.) For non-normal distributions, they are not 
equivalent. When the loss distribution has a heavier tail than a normal distribution, the 
97.5% ES can be considerably greater than the 99% VaR. 

Under FRTB, the 10-day time horizon used in Basel I and Basel II.5 is changed to 
reflect the liquidity of the market variable being considered. FRTB considers changes 
to market variables that would take place (in stressed market conditions) over periods of 
time reflecting their liquidity. The changes are referred to as shocks. The market variables 
are referred to as risk factors. The periods of time considered are referred to as liquidity 
horizons. Five different liquidity horizons are specified: 10 days, 20 days, 40 days, 60 days, 
and 120 days. The allocation of risk factors to these liquidity horizons is indicated in 
Table 18.1. 

FRTB specifies both a standardized approach and an internal models approach for 
calculating market risk capital. Even when banks have been approved to use the inter- 
nal models approach, they are required by regulators to calculate required capital under 
both approaches. This is consistent with the Basel Committee’s plans to use standard- 
ized approaches to provide a floor for capital requirements. As discussed in Section 16.4, 
in December 2017, the Basel Committee announced a move to a situation where total 
required capital is at least 72.5% of that given by standardized approaches. It will achieve 


‘From equation (12.2), the ES for a normal distribution with mean p and standard deviation 


o is p+ oexp(—Y*/2)/[1/2n(1 — X)] where X is the confidence level and Y is the point on a 
normal distribution that has a probability of 1 — X of being exceeded. This can also be written 
u + of (VaR)/(1 — X) where f is the probability density function for the loss. 
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Table 18.1 Allocation of Risk Factors to Liquidity Horizons 


Risk Factor Horizon (days) 
Interest rate (dependent on currency) 10—60 
Interest rate volatility 60 
Credit spread: sovereign, investment grade 20 
Credit spread: sovereign, non-investment grade 40 
Credit spread: corporate, investment grade 40 
Credit spread: corporate, non-investment grade 60 
Credit spread: other 120 
Credit spread volatility 120 
Equity price: large cap 10 
Equity price: small cap 20 
Equity price: large cap volatility 20 
Equity price: small cap volatility 60 
Equity: other 60 
Foreign exchange rate (dependent on currency) 10—40 
Foreign exchange volatility 40 
Energy price 20 
Precious metal price 20 
Other commodities price 60 
Energy price volatility 60 
Precious metal volatility 60 
Other commodities price volatility 120 
Commodity (other) 120 


this by 2027 with a five-year phase-in period. These changes are a culmination of a trend 
by the Basel Committee since the 2008 crisis to place less reliance on internal models 
and to use standardized models to provide a floor for capital requirements. 

A difference between FRTB and previous market risk regulatory requirements is that 
most calculations are carried out at the trading desk level. Furthermore, permission to use 
the internal models approach is granted on a desk-by-desk basis. Therefore it is possible 
that, at a particular point in time, a bank’s foreign currency trading desk has permission 
to use the internal models approach while the equity trading desk does not. 

In earlier chapters, we saw how the ways in which capital is calculated for the trading 
book and the banking book are quite different. This potentially gives rise to regulatory 
arbitrage where banks choose to allocate instruments to either the trading book or the 
banking book so as to minimize capital. In Basel II.5, the incremental risk charge made 
this less attractive. FRTB counteracts regulatory arbitrage by defining more clearly than 
previously the differences between the two books. 


18.2 Standardized Approach 


Under the standardized approach, the capital requirement is the sum of three components: 
a risk charge calculated using a risk sensitivity approach, a default risk charge, and a 
residual risk add-on. 
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Consider the first component. Seven risk classes (corresponding to trading desks) are 
defined (general interest rate risk, foreign exchange risk, commodity risk, equity risk, and 
three categories of credit spread risk). Within each risk class, a delta risk charge, vega risk 
charge, and curvature risk charge are calculated. 

The delta risk charge for a risk class is calculated using the risk weights and weighted 
sensitivity approach described in Section 14.6: 


Risk Charge = 5 > p80 W; W, (18.1) 


1 ah 


where the summations are taken over all risk factors in the risk class. The risk weights, W,, 
and the correlations between risk factors, Pip are determined by the Basel Committee? 
The weighted sensitivities (or deltas), 5;, are determined by the bank. As explained in 
Chapter 14, in the case of risk factors such as equity prices, exchange rates, or commod- 
ity prices, the deltas measure the sensitivity of the portfolio to percentage changes. For 
example, if a 1% increase in a commodity price would increase the value of a portfolio 
by $3,000, the delta would be 3,000/0.01 = 300,000. In the case of risk factors such 
as interest rates and credit spreads, the deltas are defined in terms of absolute changes. 
For example, if the effect of an interest rate increasing by one basis point (0.0001) is to 
reduce the value of a portfolio by $200, the delta with respect to that interest rate would 
be —200/0.0001 = —2,000,000. 

Consider how the risk weights, W, might be set by regulators. Suppose first that 
all risk factors are equity prices, exchange rates, or commodity prices, so the deltas are 
sensitivities to percentage changes. If W; were set equal to the daily volatility of risk factor 
i for all i, the risk charge in equation (18.1) would equal the standard deviation of change 
in the value of the portfolio per day. This corresponds to the Markowitz result discussed 
in Chapters 1 and 14. If W; were set equal to the daily volatility of risk factor i in stressed 
market conditions (the stressed daily volatility) for all i, equation (18.1) would give the 
standard deviation of the daily change of the portfolio in stressed market conditions. In 
practice, the W, are set equal to multiples of the stressed daily volatility to reflect the 
liquidity horizon and the confidence level that regulators wish to consider. Suppose that 
the stressed daily volatility of risk factor i is estimated as 2% and that the risk factor 


has a 20-day liquidity horizon. The risk weight might be set as 0.02 x 4/20 X 2.338 = 
0.209. (Note that the 2.338 multiplier reflects the amount by which a standard deviation 
has to be multiplied to get ES with a 97.5% confidence when a normal distribution is 
assumed.) 

Now suppose that the risk factors are interest rates and credit spreads so that deltas are 
sensitivities with respect to actual changes measured in basis points. The W, for risk factor 
i is set equal to a multiple of the stressed daily standard deviation for all i. If the multiple 
were 1, the formula would give the standard deviation of the value of the portfolio in 


> Banks are required to test the effect of multiplying the correlations specified by the Basel Committee 
by 1.25, 1.00, and 0.75 and then set the capital charge equal to the greatest result obtained. 
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one day. In practice the multiple is determined as just described to reflect the liquidity 
horizon and confidence level. 

Vega risk is handled similarly to delta risk.° A vega risk charge is calculated for each 
risk class using equation (18.1). The risk factors (counted by the i and j) are now volatili- 
ties. The summation is taken over all volatilities in the risk class. The parameter ô; is actu- 
ally a vega. It is the sensitivity of the value of the portfolio to small changes in volatility 
i.’ The parameter P; is the correlation between changes in volatility í and volatility j, and 
W, is the risk weight for volatility i. The latter is determined similarly to the delta risk 
weights to reflect the volatility of the volatility i, its liquidity horizon, and the confidence 
level. 

There are assumed to be no diversification benefits between risk factors in different 
risk classes and between the vega risks and delta risks within a risk class. The end product 
of the calculations we have described so far is therefore the sum of the delta risk charges 
across the seven risk classes plus the sum of the vega risk charges across the seven risk 
classes. 


18.2.1 Term Structures 


In the case of risk factors such as interest rates, volatilities, and credit spreads, there is 
usually a term structure defined by a number of points. For example, an interest rate 
term structure is typically defined by 10 points. These are the zero-coupon interest rates 
for maturities of 3 months, 6 months, 1 year, 2 years, 3 years, 5 years, 10 years, 15 years, 
20 years, and 30 years. As discussed in Chapter 14, each vertex of the term structure is 
a separate risk factor for the purposes of using equation (18.1). The delta of a portfolio 
with respect to a one basis point move in one of the vertices on the term structure is 
calculated by increasing the position of the vertex by one basis point while making no 
change to the other vertices. The Basel Committee defines risk weights for each vertex 
of the term structure and correlations between the vertices of the same term structure. 

A simplification is used when correlations between points on different term struc- 
tures are defined. The correlations between point A on term structure 1 and point B 
on term structure 2 are assumed to be the same for all A and B. This was one of the 
alternatives considered in Chapter 14; see equation (14.6). 


18.2.2 Curvature Risk Charge 


The curvature risk charge is a capital charge for a bank’s gamma risk exposure under 
the standardized approach. Consider the exposure of a portfolio to the ith risk factor. 
Banks are required to test the effect of increasing and decreasing the risk factor by its 
risk weight, W,. If the portfolio is linearly dependent on the risk factor, the impact of 


® As discussed in Chapter 14, this works well because most of the value of a derivative is in many cases 
approximately linearly dependent on volatility. 
7Banks can choose whether it is percentage or actual changes in volatility that are considered. 
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(a) Portfolio value (b) 


Portfolio 
value 


Risk factor i Risk factor i 


Figure 18.1 Calculation of Curvature Risk Charge for a Risk Factor 
In Figure 18.1a, the curvature risk charge is AB; in Figure 18.1b, it is zero. 


an increase of W, in the risk factor is W,6;. Similarly, the impact of a decrease of W; in 
the risk factor is —6,W;. To evaluate the impact of curvature net of the delta effect, the 
standardized approach therefore calculates 


1. W,6; minus the impact of a increase of W, in the risk factor, and 
2. —W ð; minus the impact of a decrease in the risk factor of W;,. 


The curvature risk charge for the risk factor is the greater of these two. If the impact of 
curvature net of delta is negative, it is counted as zero. The calculation is illustrated in 
Figure 18.1. In Figure 18.1a, the portfolio value is currently given by point O. If there 
were no curvature, an increase of W, in the risk factor would lead to the portfolio value 
at point C, whereas a decrease of W; in the risk factor would lead to the portfolio value 
at point A. Because of curvature, an increase of W, leads to the portfolio value at point 
D, and a decrease of W; leads to the portfolio value at point B. Since AB > CD, the risk 
charge is AB. In Figure 18.1b, the risk charge is zero because curvature actually increases 
the value of the position (relative to what delta would suggest) for both increases and 
decreases in the risk factor. (Figure 18.1a could correspond to a short position in an 
option; Figure 18.1b could correspond to a long position in an option.) 

When there are several risk factors, each is handled similarly to Figure 18.1. When 
there is a term structure (e.g., for interest rates, credit spreads, and volatilities), all points 
are shifted by the same amount for the purpose of calculating the effect of curvature. 
The shift is the largest W; for the points on the term structure. In the case of an interest 
rate term structure, the W, corresponding to the three-month vertex is often the largest 
W,, so this would define an upward and downward parallel shift in the term structure. 
The delta effect is removed for each point on the term structure by using the ô; for that 
point. 

The curvature risk charges for different risk factors are combined to determine a total 
curvature risk charge. When diversification benefits are allowed, aggregation formulas 
broadly similar to those used for deltas are used with correlations specified by the Basel 
Committee. 
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18.2.3 Default Risk Charge 


Risks associated with counterparty credit spread changes are handled separately from 
risks associated with counterparty defaults in FRTB. In the standardized approach, 
credit spread risks are handled using the delta/vega/curvature approach described earlier. 
Default risks, sometimes referred to as jump-to-default (JTD) risks, are handled by a sep- 
arate default risk charge. This is calculated by multiplying each exposure by a loss given 
default (LGD) and a default risk weight. Both the LGD and the risk weight are specified 
by the Basel Committee. For example, the LGD for senior debt is specified as 75% and 
the default risk for a counterparty rated A is 3%. Equity positions are subject to a default 
risk charge with an LGD = 100%. Rules for offsetting exposures are specified. 


18.2.4 Residual Risk Add-On 


The residual risk add-on considers risks that cannot be handled by the delta/ 
vega/curvature approach described earlier. It includes exotic options when they cannot 
be considered as linear combinations of plain vanilla options. The add-on is calculated 
by multiplying the notional amount of the transaction by a risk weight that is specified 
by the Basel Committee. In the case of exotic options the risk weight is 1%. 


18.2.5 A Simplified Approach 


In this section, we have described the standardized approach that the Basel Committee 
requires all large banks to use. It is worth noting that in June 2017 the Basel Committee 
published a consultative document outlining a simplified standardized approach that it 
proposes for smaller banks The full approach is simplified in a number of ways. For 
example, vega and gamma risk do not have to be considered. This should make FRTB 
more attractive to jurisdictions such as the United States that have many small banks that 
tend to enter into only relatively simple transactions. 


18.3 Internal Models Approach 


The internal models approach requires banks to estimate stressed ES with a 97.5% con- 
fidence. FRTB does not prescribe a particular method for doing this. Typically the his- 
torical simulation approach described in Chapter 13 is likely to be used. Risk factors are 
allocated to liquidity horizons as indicated in Table 18.1. Define: 


Category 1 Risk Factors: Risk factors with a time horizon of 10 days 
Category 2 Risk Factors: Risk factors with a time horizon of 20 days 
Category 3 Risk Factors: Risk factors with a time horizon of 40 days 


8 See Basel Committee on Banking Supervision, “Simplified Alternative to the Standardized Approach 
to Market Risk Capital Requirements,” June 2017. 
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Category 4 Risk Factors: Risk factors with a time horizon of 60 days 
Category 5 Risk Factors: Risk factors with a time horizon of 120 days 


As we shall see, all calculations are based on considering 10-day changes in the risk 
factors. In Basel I and Basel II.5, banks are allowed to deduce the impact of 10-day changes 
from the impact of one-day changes using a v10 multiplier. In FRTB, banks are required 
to consider changes over periods of 10 days that occurred during a stressed period in the 
past. Econometricians naturally prefer that non-overlapping periods be used when VaR 
or ES is being estimated using historical simulation, because they want observations on 
the losses to be independent. However, this is not feasible when 10-day changes are 
considered, because it would require a very long historical period. FRTB requires banks 
to base their estimates on overlapping 10-day periods. The first simulation trial assumes 
that the percentage changes in all risk factors over the next 10 days will be the same as 
their changes between Day 0 and Day 10 of the stressed period; the second simulation 
trial assumes that the percentage changes in all risk factors over the next 10 days will be 
the same as their changes between Day 1 and Day 11 of the stressed period; and so on. 

Banks are first required to calculate ES when 10-day changes are made to all risk 
factors. (We will denote this by ES,.) They are then required to calculate ES when 10- 
day changes are made to all risk factors in categories 2 and above with risk factors in 
category 1 being kept constant. (We will denote this by ES,.) They are then required to 
calculate ES when 10-day changes are made to all risk factors in categories 3, 4, and 5 
with risk factors in categories 1 and 2 being kept constant. (We will denote this by ES;.) 
They are then required to calculate ES when 10-day changes are made to all risk factors 
in categories 4 and 5 with risk factors in categories 1, 2, and 3 being kept constant. (We 
will denote this by ES,.) Finally, they are required to calculate ES;, which is the effect of 
making 10-day changes only to category 5 risk factors. 

The liquidity-adjusted ES is calculated as 


(18.2) 


where LH; is the liquidity horizon for category j. To understand equation (18.2), suppose 
first that all risk factors are in category 1 or 2 so that only ES, and ES, are calculated. 
It is assumed that the behavior of all risk factors during a 10-day period is independent 
of the behavior of category 2 risk factors during a further 10-day period. An extension 
of the square root rule discussed in Chapter 13 then leads to the liquidity-adjusted ES 


being 
4/ ES} + ES} 


Now suppose that there are also category 3 risk factors. The expression 4/ ES; + ES 
would be correct if the category 3 risk factors had a 20-day instead of a 40-day 
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liquidity horizon. We assume that the behavior of the category 3 risk factors over an 
additional 20 days is independent of the behavior of all the risk factors over the periods 
already considered. We also assume that the ES for the category 3 risk factors over 20 


days is V2 times their ES over 10 days. This leads to a liquidity-adjusted ES of: 


4/ ES} + ES + 2ES2 


Continuing in this way, we obtain equation (18.2). This is referred to as the cascade 
approach to calculating ES (and can be used for VaR as well). 

Calculations are carried out for each desk. If there are six desks, this means the inter- 
nal models approach, as we have described it so far, requires 5 X 6 = 30 ES calculations. 
As mentioned, the use of overlapping time periods is less than ideal because changes in 
successive historical simulation trials are not independent. This does not bias the results, 
but it reduces the effective sample size, making results more noisy than they would oth- 
erwise be. 

FRTB represents a movement away from basing calculations on one-day changes. 
Presumably the Basel Committee has decided that, in spite of the lack of independence 
of observations, a measure calculated from 10-day changes provides more relevant infor- 
mation than a measure calculated from one-day changes. This could be the case if changes 
on successive days are not independent, but changes in successive 10-day periods can rea- 
sonably be assumed to be independent. 

The calculation of a stressed measure (VaR or ES) requires banks to search for the 
period in the past when market variable changes would be worst for their current port- 
folio. (The search must go back as far as 2007.) When Basel II.5 was implemented, a 
problem was encountered in that banks found that historical data were not available for 
some of their current risk factors. It was therefore not possible to know how these risk 
factors would have behaved during the 250-day periods in the past that were candidates 
for the reference stressed period. FRTB handles this by allowing the search for stressed 
periods to involve a subset of risk factors, provided that at least 75% of the current risk 
factors are used. The expected shortfalls that are calculated are scaled up by the ratio 
of ES for the most recent 12 months using all risk factors to ES for the most recent 
12 months using the subset of risk factors. (This potentially doubles the number of ES 
calculations from 30 to 60.) 

Banks are required to calculate ES for the whole portfolio as well for each of six 
trading desks. The ES for a trading desk is referred to as a partial expected shortfall. It is 
determined by shocking the risk factors belonging to the trading desk while keeping all 
other risk factors fixed. The sum of the partial expected shortfalls is always greater than 
the ES for the whole portfolio. What we will refer to as the weighted expected shortfall 
(WES) is a weighted average of (a) the ES for the whole portfolio and (b) the sum of the 
partial expected shortfalls. Specifically: 


WES =A xX EST + (1 — A) x È, ESP, 
J 
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where EST is the expected shortfall calculated for the total portfolio and ESP; is the jth 
partial expected shortfall. The parameter A is set by the Basel Committee to be 0.5. 
Some risk factors are categorized as non-modelable. Specifically, if there are less than 
24 observations on a risk factor in a year or more than one month between successive 
observations, the risk factor is classified as non-modelable. Such risk factors are handled 
by special rules involving stress tests. 
The total capital requirement for day t is 


max(WES,_, +NMC,_,,m, X WES, + NMC 


avg aie) 


where WES,_, is the WES for day t— 1, NMC,_, is the capital charge calculated for 
non-modelable risk factors on day t — 1, WES 
60 days, and NMC 
factors over the previous 60 days. The parameter m, is at minimum 1.5. 


is the average WES for the previous 


avg 


avg 18 the average capital charge calculated for the non-modelable risk 


18.3.1 Back-Testing 


FRTB does not back-test the stressed ES measures that are used to calculate capital under 
the internal models approach for two reasons. First, it is more difficult to back-test ES 
than VaR. Second, it is not possible to back-test a stressed measure at all. The stressed data 
upon which a stressed measure is based are extreme data that statistically speaking are not 
expected to be observed with the same frequency in the future as they were during the 
stressed period. 

FRTB back-tests a bank’s models by asking each trading desk to back-test a VaR 
measure calculated over a one-day horizon and the most recent 12 months of data. Both 
99% and 97.5% confidence levels are to be used. If there are more than 12 exceptions 
for the 99% VaR or more than 30 exceptions for the 97.5% VaR, the trading desk is 
required to calculate capital using the standardized approach until neither of these two 
conditions continues to exist. 

Banks may be asked by regulators to carry out other back-tests. Some of these could 
involve calculating the p-value of the profit or loss on each day. This is the probability 
of observing a profit that is less than the actual profit or a loss that is greater than the 
actual loss. If the model is working perfectly, the p-values obtained should be uniformly 
distributed. 


18.3.2 Profit and Loss Attribution 


Another test used by the regulators is known as profit and loss attribution. Banks are required 
to compare the actual profit or loss in a day with that predicted by their models. Two 
measures must be calculated. The measures are: 


Mean of U 
Standard Deviation of V 


Variance of U 
Variance of V 
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where U denotes the difference between the actual and model profit/loss in a day and V 
denotes the actual profit/loss in a day? Regulators expect the first measure to be between 
—10% and +10% and the second measure to be less than 20%. When there are four or 
more situations in a 12-month period where the ratios are outside these ranges, the desk 
must use the standardized approach for determining capital. 


18.3.3 Credit Risk 


As mentioned, FRTB distinguishes two types of credit risk exposure to a company: 


1. Credit spread risk is the risk that the company’s credit spread will change, causing the 
mark-to-market value of the instrument to change. 
2. Jump-to-default risk is the risk that there will be a default by the company. 


Under the internal models approach, the credit spread risk is handled in a similar way 
to other market risks. Table 18.1 shows that the liquidity horizon for credit spread varies 
from 20 to 120 days and the liquidity horizon for a credit spread volatility is 120 days. 
The jump-to-default risk is handled in the same way as default risks in the banking book. 
In the internal models approach, the capital charge is based on a VaR calculation with a 
one-year time horizon and a 99.9% confidence level. 


18.3.4 Securitizations 


The comprehensive risk measure (CRM) charge was introduced in Basel II.5 to cover the 
risks in products created by securitizations such as asset-backed securities and collateral- 
ized debt obligations (see Section 16.1). The CRM rules allow a bank (with regulatory 
approval) to use its own models. The Basel Committee has concluded that this is unsatis- 
factory because there is too much variation in the capital charges calculated by different 
banks for the same portfolio. It has therefore decided that under FRTB the standardized 
approach must be used for securitizations. 


18.4 Trading Book vs. Banking Book 


The FRTB addresses whether instruments should be put in the trading book or the 
banking book. Roughly speaking, the trading book consists of instruments that the bank 
intends to trade. The banking book consists of instruments that are expected to be held 
to maturity. Instruments in the banking book are subject to credit risk capital whereas 
those in the trading book are subject to market risk capital. The two sorts of capital are 
calculated in quite different ways. This has in the past given rise to regulatory arbitrage. 
For example, as we have mentioned in earlier chapters, banks have often chosen to hold 


? The “actual” profit/loss should be the profit and loss that would occur if there had been no trading 
in a day. This is sometimes referred to as the hypothetical profit and loss. 
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credit-dependent instruments in the trading book because they are then subject to less 
regulatory capital than they would be if they had been placed in the banking book. 

The FRTB attempts to make the distinction between the trading book and the 
banking book clearer and less subjective. To be in the trading book, it will no longer 
be sufficient for a bank to have an “intent to trade.” It must be able to trade and manage 
the underlying risks on a trading desk. The day-to-day changes in value should affect 
equity and pose risks to solvency. The FRTB provides rules for determining for different 
types of instruments whether they should be placed in the trading book or the banking 
book. 

An important point is that instruments are assigned to the banking book or the trad- 
ing book when they are initiated and there are strict rules preventing them from being 
subsequently moved between the two books. Transfers from one book to another can 
happen only in extraordinary circumstances. (Examples given of extraordinary circum- 
stances are the closing of trading desks and a change in accounting standards with regard 
to the recognition of fair value.) Any capital benefit as a result of moving items between 


the books will be disallowed. 


Summary 


FRTB is a major change to the way capital is calculated for market risk. After 20 years 
of using VaR with a 10-day time horizon and 99% confidence to determine market 
risk capital, regulators are switching to using ES with a 97.5% confidence level and 
varying time horizons. The time horizons, which can be as high as 120 days, are designed 
to incorporate liquidity considerations into the capital calculations. The change that is 
considered to a risk factor when capital is calculated reflects movements in the risk factor 
over a period of time equal to the liquidity horizon in stressed market conditions. 

The Basel Committee has specified a standardized approach and an internal models 
approach. Even when they have been approved by their supervisors to use the inter- 
nal models approach, banks must also implement the standardized approach. Regulatory 
capital under the standardized approach is based on formulas involving the delta, vega, 
and gamma exposures of the trading book. Regulatory capital under the internal mod- 
els approach is based on the calculation of stressed expected shortfall. Calculations are 
carried out separately for each trading desk. 


Further Reading 


Bank for International Settlements. “Minimum Capital Requirements for Market Risk,” January 2016. 


Practice Questions and Problems 
(Answers at End of Book) 


18.1 Outline the differences between the way market risk capital is calculated in (a) 
Basel I, (b) Basel II.5, and (c) the FRTB. 
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18.2 


18.3 


18.4 
18.5 


18.6 


18.7 


Use footnote 4 to verify that when losses have a normal distribution with mean p 
and standard deviation o the 97.5% expected shortfall is p + 2.3380. 

Explain why the use of overlapping time periods proposed by the FRTB does not 
give independent observations on the changes in variables. 

What are the advantages of expected shortfall over value at risk? 

What is the difference between the trading book and the banking book? Why are 
regulators concerned about specifying whether an instrument should be one or the 
other in the FRTB? 

How are credit trades handled under the FRTB? 


Further Question 


Suppose that an investor owns the $10 million portfolio in Table 13.1 on September 
30, 2014. The values of the DJIA, FTSE 100, CAC 40, and Nikkei 225 on that day 
were 17,042.90, 6,622.7, 4,416.24, and 16,173.52. The exchange rates on that day 
were 1.6211 USD per GBP,0.7917 EUR per USD, and 109.64 JPY per USD. Sup- 
pose that the 250 days ending September 9, 2008, constitute the stressed period for 
the portfolio. Calculate the 97.5% expected shortfall using the overlapping periods 
method in conjunction with historical simulation and the cascade approach. Rele- 
vant data on the indices is on the author’s website (see “Worksheets for Value at Risk 
Example”). For the purposes of this problem, assume that DJIA and FTSE have a 
10-day liquidity horizon, CAC 40 has a 40-day liquidity horizon, and Nikkei 225 
has a 20-day liquidity horizon. For each day during the stressed period, consider 
the change in a variable over a 10-day period ending on the day. 


Part Four 


CREDIT RISK 


Chapter 19 


Estimating Default 
Probabilities 


s explained in Chapter 15, regulators have for a long time required banks to 

keep capital for credit risk. Under Basel II banks can, with approval from bank 

supervisors, use their own estimates of default probabilities to determine the 
amount of capital they are required to keep. This has led banks to search for better ways 
of estimating these probabilities. 

In this chapter, we discuss a number of different approaches to estimating default 
probabilities and explain the key difference between risk-neutral and real-world esti- 
mates. The material we cover will be used in Chapter 20 when we examine how the 
price of a derivative in the bilaterally cleared over-the-counter market can be adjusted 
for counterparty credit risk, and in Chapter 21 when we discuss the calculation of credit 
value at risk. 


19.1 Credit Ratings 


As explained in Section 1.7, rating agencies such as Moody’s, S&P, and Fitch provide 
ratings describing the creditworthiness of corporate bonds.’ Because a credit rating is 


'In theory, a credit rating is an attribute of a bond issue, not a company. However, in many cases all 
bonds issued by a company have the same rating. A rating is therefore often referred to as an attribute 
of a company. 
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designed to provide information about credit quality, one might expect frequent changes 
in credit ratings as positive and negative information reaches the market. In fact, ratings 
change relatively infrequently. One of the objectives of rating agencies when they assign 
ratings is stability. For example, they want to avoid ratings reversals, where a company’s 
bonds are downgraded and then upgraded a few weeks later. Ratings therefore change 
only when there is reason to believe that a long-term change in the company’s credit- 
worthiness has taken place. The reason for this is that bond traders are major users of 
ratings. Often they are subject to rules governing what the credit ratings of the bonds 
they hold must be. (For example, many bond funds are allowed to hold only investment- 
grade bonds.) If these ratings changed frequently, they might have to do a large amount 
of trading (and incur high transaction costs) just to satisfy the rules. 

A related point is that rating agencies try to “rate through the cycle.’ Suppose that 
the economy exhibits a downturn and this has the effect of increasing the probability 
of a company defaulting in the next six months, but makes very little difference to the 
company’s probability of defaulting over the next three to five years. A rating agency 
would not usually change the company’s credit rating in these circumstances. 

Companies such as Moody’s KMV and Kamakura provide estimates of a company’s 
probability of default that are based on its equity price and other variables. These estimates 
do not have stability as one of their objectives and tend to respond more quickly to 
market information than credit ratings. The types of models that are used to produce the 
estimates will be discussed in Section 19.8. 


19.1.1 Internal Credit Ratings 


Most banks have procedures for rating the creditworthiness of their corporate and retail 
clients. This is a necessity. The ratings published by rating agencies are usually available 
only for companies that have issued publicly traded debt. As a result, many small and 
medium-sized companies do not have credit ratings from rating agencies. As explained 
in Chapter 15, the internal-ratings-based (IRB) approach in Basel II allows banks to use 
their internal ratings in determining the probability of default, PD. 

Internal-ratings-based approaches for estimating PD typically involve profitability 
ratios such as return on assets and balance sheet ratios such as current assets divided by 
current liabilities (the current ratio) and debt to equity. Banks recognize that it is cash 
rather than profits that is necessary to repay a loan. They typically take the financial 
information provided by a company and convert it to a cash flow statement. This allows 
them to estimate how easy it will be for a company to service its debt. 


19.1.2 Altman’s Z-Score 


Edward Altman has pioneered the use of accounting ratios to predict default. In 1968, 
he developed what has become known as the Z-score? Using a statistical technique 


2See E. I. Altman, “Financial Ratios, Discriminant Analysis, and the Prediction of Corporate 
Bankruptcy,’ Journal of Finance 23, no. 4 (September 1968): 589-609. 
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known as discriminant analysis, he attempted to predict defaults from five accounting 
ratios: 


X,: Working capital/Total assets 

X,: Retained earnings/Total assets 

X;: Earnings before interest and taxes/Total assets 

X,: Market value of equity/Book value of total liabilities 
X;: Sales/ Total assets 


For publicly traded manufacturing companies, the original Z-score was 
Z = 1.2X, + 1.4X, + 3.3X, + 0.6X, + 0.999%, (19.1) 


If the Z-score was greater than 3.0, the company was considered unlikely to default. If it 
was between 2.7 and 3.0, there was reason to be “on alert.” Ifit was between 1.8 and 2.7, 
there was a good chance of default. If it was less than 1.8, the probability of a financial 
embarrassment was considered to be very high. Equation (19.1) was estimated from a 
sample of 66 publicly traded manufacturing companies. Of these, 33 failed within one 
year and 33 did not fail within one year. The model proved to be fairly accurate when 
tested out of sample (i.e., on a set of firms different from that used to estimate equation 
19.1). Both Type I errors (companies that were predicted not to go bankrupt but did do 
so) and Type II errors (companies that were predicted to go bankrupt, but did not do so) 
were small? Variations on the model have been developed for manufacturing companies 
that are not publicly traded and for non-manufacturing companies. 


Example 19.1 


Consider a company for which working capital is 170,000, total assets are 670,000, 
earnings before interest and taxes is 60,000, sales are 2,200,000, the market value of 
equity is 380,000, total liabilities is 240,000, and retained earnings is 300,000. In this 
case, X, = 0.254, X, = 0.448, X, = 0.0896, X, = 1.583, and X, = 3.284. The Z-score 
is 


1.2 X 0.254 + 1.4 x 0.448 + 3.3 x 0.0896 + 0.6 x 1.583 + 0.999 x 3.284 = 5.46 


The Z-score indicates that the company is not in danger of defaulting in the near future. 


The Z-score methodology has been revised and extended since Altman’s original 
research more than 45 years ago and can now be used to produce probabilities of default 
for applications such as Basel II. 


3 Type I errors are liable to be more costly to the lending department of a commercial bank than Type 
II errors. 
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19.2 Historical Default Probabilities 


Table 19.1 is typical of the data that are produced by rating agencies. It shows the default 
experience through time of companies that started with a certain credit rating. For exam- 
ple, Table 19.1 shows that a bond with an initial Moody’s credit rating of Baa has a 
0.177% chance of defaulting by the end of the first year, a 0.461% chance of defaulting 
by the end of the second year, and so on. The probability of a bond defaulting during 
a particular year can be calculated from the table. For example, the probability that a 
bond initially rated Baa will default during the second year of its life is 0.461 — 0.177 = 
0.284%. 

Table 19.1 shows that, for investment-grade bonds, the probability of default in a 
year tends to be an increasing function of time. (For example, the probabilities of a bond 
initially rated Aa defaulting during years one, two, three, four, and five are 0.021%, 0.039%, 
0.050%, 0.082%, and 0.106%, respectively.) This is because the bond issuer is initially 
considered to be creditworthy and the more time that elapses, the greater the possibility 
that its financial health will decline. For bonds with a poor credit rating, the probability of 
default is often a decreasing function of time. (For example, the probabilities that a bond 
initially in the lowest (Caa-C) category will default during years one, two, three, four, and 
five are 10.624%, 8.046%, 6.773%, 5.531%, and 4.569%, respectively.) The reason here 
is that, for a bond with a poor credit rating, the next year or two may be critical. If the 
issuer survives this period, its financial health is likely to have improved. 


19.2.1 Hazard Rates 


From Table 19.1, we can calculate the probability of a Caa-C bond defaulting during 
the third year as 25.443 — 18.670 = 6.773%. We will refer to this as the unconditional 
default probability. It is the probability of default during the third year as seen at time 
zero. The probability that the Caa-rated bond will survive until the end of year two 
is 100 — 18.670 = 81.330%. The probability that it will default during the third year 
conditional on no earlier default is therefore 0.06773/0.81330 or 8.33%. 

The 8.33% we have just calculated is a conditional default probability for a one- 
year time period. When we consider a conditional default probability for a short time 


Table 19.1 Average Cumulative Issuer-Weighted Default Rates (%), 1970-2016 


Time (years) 1 2 3 4 5 7 10 15 20 

Aaa 0.000 0.011 0.011 0.031 0.085 0.195 0.386 0.705 0.824 
Aa 0.021 0.060 0.110 0.192 0.298 0.525 0.778 1.336 2.151 
A 0.055 0.165 0.345 0.536 0.766 1.297 2.224 3.876 5.793 
Baa 0.177 0.461 0.804 1.216 1.628 2.472 3.925 7.006 10.236 
Ba 0.945 2.583 4.492 6.518 8.392 11.667 16.283 23.576 29.733 
B 3.573 8.436 13.377 17.828 21.908 28.857 36.177 43.658 48.644 
Caa-C 10.624 18.670 25.443 30.974 35.543 42.132 50.258 53.377 53.930 


Source: Moody’. 
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period of length At, we get a measure known as the hazard rate or default intensity. The 
hazard rate, A(t), at time t is defined so that A(t)At is the probability of default between 
time t and t + At conditional on no default between time zero and time t. If V (À is the 
cumulative probability of the company surviving to time t (i.e., no default by time f), 
the unconditional default probability between times t and t+ At is [V (ft) — V(t + Ad]. 
The probability of default between times t and t + At conditional on no earlier default 
is [V(t) — V(t + At)|/ V(t). Hence 


Vi) -Vitt Ad) 


Z0 = M(NAt 
or 
Viet At) — V(t) 
ie = -MAV (À 
Taking limits 
dV(t) _ E 
Pa = X(t) V(t) 


from which 
V(t) =e h Mon 
Defining Q(t) as the probability of default by time t, so that Q(t) = 1 — V(t) gives 
Q(t) =1-—e i Mt)de 


or 


QA) = 1 = (19.2) 
where NG) is the average hazard rate between time zero and time t. 


Example 19.2 


Suppose that the hazard rate is a constant 1.5% per year. The probability of a default 
by the end of the first year is 1 — e~°°!°*! = 0,0149. The probability of a default by 
the end of the second year is 1 — e~°0>*? = 0.0296. The probabilities of a default by 
the end of the third, fourth, and fifth years are similarly 0.0440, 0.0582, and 0.0723. 
The unconditional probability of a default during the fourth year is 0.0582 — 0.0440 = 
0.0142. The probability of default in the fourth year, conditional on no earlier default, is 
0.0142/(1 — 0.0440) = 0.0149. 
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Table 19.2 Recovery Rates on Corporate Bonds as a Percent 
of Face Value, 1983 to 2016, Issuer Weighted 


Average 
Class Recovery Rate (%) 
1st lien bond 52.8 
2nd lien bond 44.6 
Senior unsecured bond 37.2 
Senior subordinated bond 31.1 
Subordinated bond 31.9 
Junior subordinated bond 23.2 


Source: Moody’. 
19.3 Recovery Rates 


When a company goes bankrupt, those who are owed money by the company file claims 
against the company.* Sometimes there is a reorganization in which these creditors agree 
to a partial payment of their claims. In other cases, the assets are sold by the liquidator 
and the proceeds are used to meet the claims as far as possible. Some claims typically have 
priorities over other claims and are met more fully. 

The recovery rate for a bond is normally defined as the price at which it trades about 
one month after default as a percent of its face value. As we saw in Chapter 15, the 
Basel II formulas are expressed in terms of the loss given default (LGD). The percentage 
recovery rate is 100 minus the percentage loss given default. 

Table 19.2 provides historical data on average recovery rates for different categories 
of bonds in the United States. This varies from 52.8% for those who have a first lien on 
assets to 23.2% for those who rank after other lenders. 


19.3.1 The Dependence of Recovery Rates on Default Rates 


In Chapter 6, we saw that one of the lessons from the credit crisis of 2007—2008 is that 
the average recovery rate on mortgages is negatively related to the mortgage default rate. 
As the mortgage default rate increases, foreclosures lead to more houses being offered for 
sale and a decline in house prices. This in turn results in a decline in recovery rates. 
The average recovery rate on corporate bonds exhibits a similar negative dependence 
on default rates? In a year when the number of bonds defaulting is low, economic condi- 
tions are usually good and the average recovery rate on those bonds that do default might 
be as high as 60%; in a year when the default rate on corporate bonds is high, economic 
conditions are usually poor and the average recovery rate on the defaulting bonds might 


In the United States, the claim made by a bondholder is the bond’s face value plus accrued interest. 
>See E. I. Altman, B. Brady, A. Resti, and A. Sironi, “The Link between Default and Recovery Rates: 
Theory, Empirical Evidence, and Implications,” Journal of Business (November 2005): 2203-2228. The 
correlation is also discussed in publications by Moody’s Investors Service. It finds that the average 
recovery rate in a year and the non-investment-grade default rate are highly correlated. 
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be as low as 30%. The result of the negative dependence is that a bad year for defaults is 
doubly bad for a lender because it is usually accompanied by a low recovery rate. 


19.4 Credit Default Swaps 


A credit default swap (CDS) is a derivative that allows market partipants to trade credit 
risks. As indicated in Business Snapshot 19.1, the market for this product grew rapidly 
until 2007 and then declined. The simplest type of CDS is an instrument that provides 
insurance against the risk of a default by a particular company. The company is known 
as the reference entity and a default by the company is known as a credit event. The buyer of 
the insurance obtains the right to sell bonds issued by the company for their face value 
when a credit event occurs and the seller of the insurance agrees to buy the bonds for 
their face value when a credit event occurs® The total face value of the bonds that can 
be sold is known as the credit default swap’s notional principal. 

The buyer of a CDS makes periodic payments to the seller until the end of the life 
of the CDS or until a credit event occurs. These payments are usually made in arrears 
every quarter. 

An example will help to illustrate how a typical deal is structured. Suppose that 
two parties enter into a five-year credit default swap on December 20, 2018. Assume 
that the notional principal is $100 million and the buyer agrees to pay 90 basis 
points per year (quarterly in arrears) for protection against default by the reference 
entity. 

The CDS is shown in Figure 19.1. If the reference entity does not default (that is, 
there is no credit event), the buyer receives no payoff and pays approximately $225,000 
(= 0.25 X 0.0090 x 100,000,000) on March 20, June 20, September 20, and December 
20 of each of the years 2019, 2020, 2021, 2022, and 2023.’ If there is a credit event, 
a substantial payoff is likely. Suppose that the buyer notifies the seller of a credit event 
on May 20, 2021 (five months into the third year). If the contract specifies physical 
settlement, the buyer of protection has the right to sell to the seller of protection bonds 
issued by the reference entity with a face value of $100 million for $100 million. If, as 
is now usual, there is a cash settlement, a two-stage auction process is used to determine 
the mid-market value of the cheapest deliverable bond several days after the credit event. 
Suppose the auction indicates that the cheapest deliverable bond is worth $35 per $100 
of face value. The cash payoff would be $65 million. 

The regular payments from the buyer of protection to the seller of protection cease 
when there is a credit event. However, because these payments are made in arrears, a 
final accrual payment by the buyer is usually required. In our example, where there is a 
default on May 20, 2021, the buyer would be required to pay to the seller the amount of 


é The face value (or par value) of a bond is the principal amount that the issuer will repay at maturity 
if it does not default. 
7 The payments are not exactly $225,000 because of the impact of day count conventions. 
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BUSINESS SNAPSHOT 19.1 
The CDS Market 


In 1998 and 1999, the International Swaps and Derivatives Association devel- 
oped a standard contract for trading credit default swaps in the over-the-counter 
market. After that, the market grew very fast. The Bank for International Set- 
tlements (BIS) started producing statistics for the size of the credit derivatives 
market in December 2004. At that time, the total notional principal underlying 
outstanding contracts was estimated to be about $6 trillion. It peaked at $58 tril- 
lion in December 2007. But it had fallen to about $10 trillion by December 2016. 
Banks and other financial institutions are both buyers and sellers of protection. 
Banks tend to be net buyers of protection, and insurance companies tend to be 
net sellers of protection. Banks sometimes use credit default swaps to hedge an 
exposure to a borrower. It is then the case that the financial institution bearing 
the credit risk of a loan is different from the financial institution that did the 
original credit checks. 

During the credit turmoil that started in August 2007, regulators became 
concerned that CDSs were a source of systemic risk. (See Business Snapshot 15.1 
for a discussion of systemic risk.) No doubt their concerns arose in part because 
of the losses experienced by the insurance company AIG. This was a big seller of 
protection on the AAA-rated tranches created from mortgages (see Chapter 6). 
The protection proved very costly to AIG and a failure of AIG would have led 
to big losses elsewhere in the financial system. AIG was bailed out by the United 
States government in September 2008. 

CDSs have come under criticism during the European sovereign debt crisis. 
Some legislators feel that speculative activity in credit default swap markets has 
exacerbated the debt problems of countries such as Greece; naked CDS positions 
on sovereign debt (where credit protection on a country is bought without an 
underlying exposure) were banned in Europe in 2013. 

During 2007 and 2008, trading ceased in many types of credit derivatives, 
but plain vanilla CDSs, which provide protection against a single company or 
country defaulting, continued to trade actively (albeit with dramatically increased 
spreads). This is because their structure is easy to understand compared with, 
say, ABS CDOs (see Chapter 6). However, as pointed out in Business Snapshot 
19.2, flaws in this structure are emerging. This may account for their declining 
popularity. 

There was a huge number of CDS contracts outstanding with Lehman 
Brothers as the reference entity when Lehman Brothers declared bankruptcy 
in September 2008. The recovery rate (determined by an auction process) was 
only about eight cents on the dollar, so that the payout to the buyers of protec- 
tion was equal to about 92% of the notional principal. There were predictions 
that some sellers of protection would be unable to pay and that further bankrupt- 
cies would occur, but on the settlement day (October 21, 2008) everything went 
smoothly. 
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Default 90 basis points per year Default 
protection protection 
buyer Payment if default by seller 


reference entity 


Figure 19.1 Credit Default Swap 


the annual payment accrued between March 20, 2021, and May 20, 2021 (approximately 
$150,000), but no further payments would be required. 

The total amount paid per year, as a percent of the notional principal, to buy pro- 
tection is known as the CDS spread. (In our example, the CDS spread is 90 basis points.) 
Several large banks are market makers in the credit default swap market. For a five-year 
credit default swap on a company, a market maker might quote: bid 250 basis points, offer 
260 basis points. This means that the market maker is prepared to buy protection by pay- 
ing 250 basis points per year (i.e., 2.5% of the principal per year) and to sell protection 
for 260 basis points per year (i.e., 2.6% of the principal per year). 

Many different companies and countries are reference entities for the CDS con- 
tracts traded. Contracts with maturities of five years are most popular, but other matu- 
rities such as 1, 2, 3, 7, and 10 years are also traded. Usually, contracts mature on the 
following standard dates: March 20, June 20, September 20, and December 20. The 
effect of this is that the actual time to maturity of a contract when it is initiated is 
close to, but not necessarily the same as, the number of years to maturity that is speci- 
fied. Suppose you call a dealer on November 15, 2018, to buy five-year protection on a 
reference entity. The contract would probably last until December 20, 2023. Your first 
payment would be due on December 20, 2018, and would equal an amount covering the 
November 15,2018, to December 20, 2018, periods After that, payments would be made 
quarterly. 

A key aspect of a CDS contract is the definition of a credit event (i.e., a default). 
Usually a credit event is defined as a failure to make a payment as it becomes due, a 
restructuring of debt, or a bankruptcy. Restructuring is sometimes excluded in North 
American contracts, particularly in situations where the yield on the company’s debt is 
high. Potential flaws in the CDS market are discussed in Business Snapshot 19.2. They 
may account for the market’s declining popularity. 


19.4.1 The Cheapest-to-Deliver Bond 


Usually a CDS specifies that a number of different bonds can be delivered in the event of 
a default. The bonds typically have the same seniority, but they may not sell for the same 


8 Ifthe time to the first date is less than one month, the first payment is typically on the second payment 
date, otherwise it is on the first payment date. 
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BUSINESS SNAPSHOT 19.2 
Is a CDS a Flawed Contract? 


There is one important difference between credit default swaps and most other 
over-the-counter derivatives. The other over-the-counter derivatives depend on 
interest rates, exchange rates, equity indices, commodity prices, and so on. There 
is NO reason to assume that any one market participant has better information 
than other market participants about these variables. 

Credit default swap spreads depend on the probability that a particular com- 
pany will default during a particular period of time. Arguably, some market 
participants have more information to estimate this probability than others. A 
financial institution that works closely with a particular company by providing 
advice, making loans, and handling new issues of securities is likely to have more 
information about the creditworthiness of the company than another financial 
institution that has no dealings with the company. Economists refer to this as an 
asymmetric information problem. 

A blatant abuse of the CDS market was reported in the Financial Times on 
January 11, 2018. A US. housebuilder, Hovnanian, obtained attractive funding 
from the hedge fund GSO Capital Partners with the agreement that it would 
skip an interest payment, thereby manufacturing a technical default. GSO had 
bought protection on Hovnanian and obtained a big payoff. Assuming that this 
is a legal arrangement, it may lead to CDSs, as currently structured, disappearing 
altogether. 


percentage of face value immediately after a default? This gives the holder of a CDS what 
is known as a cheapest-to-deliver bond option. As already mentioned, an auction process is 
usually used to determine the value of the cheapest-to-deliver bond and, therefore, the 
payoff to the buyer of protection. 

The determination of CDS spreads and the valuation of CDS transactions is discussed 
in Appendix K. 


19.4.2 Credit Indices 


Participants in credit markets have developed indices to track credit default swap 
spreads. In 2004, there were agreements between different producers of indices that 


? There are a number of reasons for this. The claim that is made in the event of a default is typically 
equal to the bond’s face value plus accrued interest. Bonds with high accrued interest at the time of 
default therefore tend to have higher prices immediately after default. Also, the market may judge that 
in the event of a reorganization of the company some bondholders will fare better than others. 
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led to some consolidation. Two important standard portfolios used by index providers 
are: 


1. CDX NA IG, a portfolio of 125 investment-grade companies in North America 
2. iTraxx Europe, a portfolio of 125 investment-grade companies in Europe 


These portfolios are updated on March 20 and September 20 each year. Companies that 
are no longer investment grade are dropped from the portfolios and new investment- 
grade companies are added.!” 

Suppose that the five-year CDX NA IG index is quoted by a market maker as bid 165 
basis points, offer 166 basis points. The quotes mean that a trader can buy CDS protection 
on all 125 companies in the index for 166 basis points per company. Suppose an investor 
wants $800,000 of protection on each company. The total cost is 0.0166 x 800,000 x 125 
or $1,660,000 per year. (The investor could similarly sell $800,000 of protection on 
each of the 125 companies for a total of $1,650,000 per annum.) When a company 
defaults, the investor receives the usual CDS payoff and the annual payment is reduced 
by 1,660,000/125 = $13,280. There is an active market in buying and selling CDS index 
protection for maturities of 3,5,7,and 10 years. The maturities for these types of contracts 
on the index are usually December 20 and June 20. (This means that a “five-year” contract 
lasts between 42 and F years.) Roughly speaking, the index is the average of the CDS 
spreads on the companies in the underlying portfolio.!! 


19.4.3 The Use of Fixed Coupons 


To facilitate trading, the precise way in which CDS and CDS index transactions work 
is a little more complicated than has been described up to now. In practice, CDS and 
CDS indices trade like bonds. For each of the standard transactions that trade, a fixed 
coupon and a recovery rate are specified. The periodic protection payments are equal to 
the coupon. The CDS spread is quoted in the market. When there is a trade, the present 
value of the difference between payments equal to the spread and payments equal to the 
coupon for the remaining life of the transaction is estimated using a specified procedure. 
If the quoted spread is less than the coupon, the seller of protection pays this present value 


10 On September 20, 2014, the Series 22 iTraxx Europe portfolio and the Series 23 CDX NA IG 
portfolio were defined. The series numbers indicate that by the end of September 2014 the iTraxx 
Europe portfolio had been updated 21 times and the CDX NA IG portfolio had been updated 22 
times. 

11 More precisely, the index is slightly lower than the average of the credit default swap spreads for the 
companies in the portfolio. To understand the reason for this, consider a portfolio consisting of two 
companies, one with a spread of 1,000 basis points and the other with a spread of 10 basis points. To 
buy protection on the companies would cost slightly less than 505 basis points per company. This is 
because the 1,000 basis points is not expected to be paid for as long as the 10 basis points and should 
therefore carry less weight. Another complication for CDX NA IG, but not iTraxx Europe, is that the 
definition of default applicable to the index includes restructuring whereas the definition for CDS 
contracts on the underlying companies may not. 
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to the buyer of protection at the time of the trade. If the quoted spread is greater than 
the coupon, the buyer of protection pays this present value to the seller of protection at 
the time of the trade. After that, the buyer of protection then pays the coupon times the 
principal applicable to those companies in the index that have not yet defaulted to the 
seller. 


19.5 Credit Spreads 


The credit spread is the extra rate of interest per annum required by investors for bearing a 
particular credit risk. CDS spreads, which were explained in the previous section, provide 
one measure of the credit spread. Another is the bond yield spread. This is the amount 
by which the yield on a corporate bond exceeds the yield on a similar risk-free bond. 
We now show that the two should be approximately equal. 


19.5.1 CDS Spreads and Bond Yields 


A CDS can be used to hedge a position in a corporate bond. Suppose that an investor buys 
a five-year corporate bond yielding 7% per year for its face value and at the same time 
enters into a five-year CDS to buy protection against the issuer of the bond defaulting. 
Suppose that the CDS spread is 200 basis points or 2% per annum. The effect of the CDS 
is to convert the corporate bond to a risk-free bond (at least approximately). If the bond 
issuer does not default, the investor earns 5% per year (when the CDS spread is netted 
against the corporate bond yield). If the bond issuer does default, the investor earns 5% 
up to the time of the default. Under the terms of the CDS, the investor is then able to 
exchange the bond for its face value. This face value can be invested at the risk-free rate 
for the remainder of the five years. 

This argument shows that the n-year CDS spread should be approximately equal 
to the excess of the par yield on an n-year corporate bond over the par yield on an 
n-year risk-free bond.’” If it is markedly less than this, an investor can earn more than 
the risk-free rate by buying the corporate bond and buying protection. If it is markedly 
greater than this, an investor can borrow at less than the risk-free rate by shorting the 
corporate bond and selling CDS protection. As will be discussed later, these are not 
perfect arbitrages, and the extent to which they can be used is in practice influenced by 
liquidity constraints and other considerations. 


19.5.2 The Risk-Free Rate 


CDSs provide a direct estimate of the credit spread. To calculate a credit spread from a 
bond yield, it is necessary to make an assumption about the risk-free rate. When bond 
yield spreads are quoted by bond traders, the risk-free rate that is used is usually the yield 


The par yield on an n-year bond is the coupon rate per year that causes the bond to sell for its par 
value (i.e., its face value). 
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on a Treasury bond of similar maturity. For example, a bond trader might quote the yield 
on a particular corporate bond as being a spread of 250 basis points over Treasuries. 

A number of researchers have compared bond yields to CDS spreads to imply a 
risk-free rate. This involves matching the maturities of CDSs and bonds and implying 
a risk-free rate from the arbitrage arguments given above. For example, if the five-year 
bond yield is 4.7% and the five-year CDS spread is 80 basis points, the implied five-year 
risk-free rate is 3.9%. 

As discussed in Section 9.2, traders have traditionally used LIBOR/swap rates as 
proxies for risk-free rates when valuing derivatives. The research indicates that this prac- 
tice has carried over to the credit market. Implied risk-free rates are much closer to the 
LIBOR/swap rates than to the Treasury rates. One estimate puts implied risk-free rates 
at about 10 basis points less than the LIBOR/swap rate.'> This estimate is plausible. As 
explained in Section 9.2, the credit risk in a swap rate is the credit risk from making a 
series of short-term loans to AA-rated counterparties and 10 basis points is a reasonable 
credit spread for a short-term AA-rated instrument. 


19.5.3 Asset Swaps 


Asset swaps provide a convenient reference point for traders in credit markets because 
they give direct estimates of the excess of bond yields over LIBOR/swap rates. 

To explain how asset swaps work, consider the situation where an asset swap spread 
for a particular bond is quoted as 150 basis points. There are three possible situations: 


1. The bond sells for its par value of 100. The swap then involves one side (Company 
A) paying the coupon on the bond and the other side (Company B) paying LIBOR 
plus 150 basis points.!* 

2. The bond sells below its par value, say, for 95. The swap is then structured so that 
Company A pays $5 per $100 of notional principal at the outset. After that, Company 
A pays the bond’s coupons and Company B pays LIBOR plus 150 basis points. 

3. The underlying bond sells above par, say, for 108. The swap is then structured so that 
Company B makes a payment of $8 per $100 of principal at the outset. After that, 
Company A pays the bond’s coupons and Company B pays LIBOR plus 150 basis 
points. 


The effect of structuring asset swaps in this way is that the present value of the asset 
swap spread (150 basis points in our example) is the amount by which the price of the 
corporate bond is exceeded by the price of a similar risk-free bond where the risk-free 
rate is assumed to be given by the LIBOR/swap curve (see Problem 19.16). 


'3See J. Hull, M. Predescu, and A. White, “The Relationship between Credit Default Swap Spreads, 
Bond Yields, and Credit Rating Announcements,” Journal of Banking and Finance 28 (November 2004): 
2789-2811. 

'4Note that it is the promised coupons that are exchanged. The exchanges take place regardless of 
whether the bond defaults. 
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19.5.4 CDS—Bond Basis 


The CDS-—bond basis is the excess of the CDS spread over the bond yield spread for a 
company. 


CDS-—Bond Basis = CDS Spread — Bond Yield Spread 


The bond yield spread is calculated relative to the LIBOR/swap benchmark. Usually it 
is assumed to be the asset swap spread. 

The arbitrage argument given above, relating CDS spreads and bond yields, suggests 
that the CDS—bond basis should be close to zero. In fact, there are a number of reasons 
why it deviates from zero. For example: 


1. The bond may sell for a price that is significantly different from par. (Bond prices 
above par tend to give rise to a negative basis; bond prices below par tend to give 
rise to a positive basis.) 

2. There is counterparty default risk in a CDS. (This pushes the basis in a negative 
direction.) 

3. There is a cheapest-to-deliver bond option in a CDS. (This pushes the basis in a 
positive direction.) 

4. The payoff in a CDS does not include accrued interest on the bond that is delivered. 
(This pushes the basis in a negative direction.) 

5. The restructuring clause in a CDS contract may lead to a payoff when there is no 
default. (This pushes the basis in a positive direction.) 

6. LIBOR is greater than the risk-free rate being assumed by the market. (This pushes 
the basis in a positive direction.) 


Prior to the market turmoil starting in 2007, the basis tended to be positive. For 
example, De Witt estimates that the average CDS bond basis in 2004 and 2005 was 16 
basis points.!° During the credit crisis, the basis was at times very negative but, as explained 
by Bai and Collin-Dufresne (2011), it was difficult for financial institutions to arbitrage 
between bonds and CDSs because of a shortage of liquidity and other considerations.’® 
Since the crisis, the magnitude of the CDS-bond basis (sometimes positive and sometimes 


negative) has become much smaller. 


19.6 Estimating Default Probabilities from Credit Spreads 


We now discuss how default probabilities can be estimated from credit spreads. 


15See J. De Witt, “Exploring the CDS—Bond Basis,” Working Paper no. 104, National Bank of Belgium, 
2006. 
See J. Bai and P. Collin-Dufresne, “The CDS—Bond Basis,” Working Paper, SSRN 2024531, 2011. 
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19.6.1 Approximate Calculation 


Suppose that a five-year credit spread (CDS spread, bond yield spread, or asset swap 
spread) for a company is 240 basis points and that the expected recovery rate in the 
event of a default is 40%. The holder of a corporate bond issued by the company must 
be expecting to lose 240 basis points (or 2.4% per year) from defaults. Roughly speak- 
ing, the credit spread can be considered to be an average loss rate. Given the recovery 
rate of 40%, this leads to an estimate of the average probability of a default per year 
over the five-year period, conditional on no earlier default, of 0.024/(1 — 0.4) or 4%. In 
general: 


(19.3) 


where s(T) is the credit spread (which should in theory be expressed with continuous 
compounding) for a maturity of T, R is the recovery rate, and i is the average hazard 
rate between time zero and time T. 

If credit spreads are known for a number of different maturities, the term structure 
of the hazard rate can be bootstrapped (at least approximately), as the following example 
illustrates. 


Example 19.3 


Suppose that the CDS spreads for 3-, 5-, and 10-year instruments are 50,60, and 100 basis 
points, respectively, and the expected recovery rate 1s 60%. The average hazard rate over 
three years is approximately 0.005/(1 — 0.6) = 0.0125. The average hazard rate over five 
years is approximately 0.006/(1 — 0.6) = 0.015. The average hazard rate over 10 years is 
approximately 0.01/(1 — 0.6) = 0.025. From this we can estimate that the average hazard 
rate between year 3 and year 5 is (5 X 0.015 — 3 x 0.0125) /2 = 0.01875. The average 
hazard rate between year 5 and year 10 is (10 x 0.025 — 5 x 0.015)/5 = 0.035. 


19.6.2 A More Exact Calculation 


The calculation we have just given works well for CDS spreads. It also works well for 
bond yield spreads and asset swap spreads when the underlying bond is selling for close 
to its par value. We now consider a more exact calculation for situations when the under- 
lying bond’s price is not close to par. 

Suppose that a five-year corporate bond with a principal of 100 provides a coupon 
of 6% per annum (paid semiannually) and that the yield on the bond is 7% per annum 
(with continuous compounding). The yield on a similar risk-free bond is 5% (again with 
continuous compounding). The yields imply that the price of the corporate bond is 
95.34 and the price of the risk-free bond is 104.09. The expected loss from default over 
the five-year life of the bond is therefore 104.09 — 95.34, or $8.75. For simplicity, we 
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Table 19.3 Calculation of Loss from Default on a Bond in Terms of the Default Probabilities per 
Year, Q 


Time Def. Recovery Default-Free Discount PV of Expected 
(yrs) Prob. Amount ($) Value ($) Loss ($) Factor Loss ($) 
0.5 Q 40 106.73 66.73 0.9753 65.08Q 
1.5 Q 40 105.97 65.97 0.9277 61.20Q 
2.5 Q 40 105.17 65.17 0.8825 57.52Q 
3.5 Q 40 104.34 64.34 0.8395 54.01Q 
4.5 Q 40 103.46 63.46 0.7985 50.67Q 
Total 288.48Q 


Notional Principal = $100. 


suppose that the unconditional probability of default per year is the same each year and 
equal to Q. Furthermore, we assume defaults can happen only at times 0.5, 1.5, 2.5, 3.5, 
and 4.5 years (immediately before coupon payment dates). Risk-free rates are assumed 
to be 5% (with continuous compounding) for all maturities and the recovery rate (i.e., 
recovery as a percent of principal) is assumed to be 40%. (The analysis can be extended 
so that defaults happen more frequently.) 

Table 19.3 calculates the expected loss from defaults in terms of Q. To illustrate the 
calculations, consider the 3.5-year row in Table 19.3. The expected value of the default- 
free bond at time 3.5 years (calculated using the forward risk-free interest rates) is 


3 + 3,e—0-05x0.5 + 3e—9-05x1.0 + 103¢70:09x15 = 104.34 


Given the definition of recovery rates in Section 19.3, the amount recovered if there is 
a default is 40 so that the loss given default is 104.34 — 40 or $64.34. The present value 
of this loss is 54.01 and the expected loss is therefore 54.01Q. 

Table 19.3 shows that the total expected loss is 288.48 Q. Setting this equal to the 
8.75 expected loss calculated earlier, we obtain a value for Q of 8.75/288.48, or 3.03%. 
The calculations we have given assume that the default probability is the same in each 
year and that defaults take place at just one time during the year. We can extend the 
calculations to assume that defaults take place more frequently. Also, instead of assuming 
a constant unconditional probability of default, we can assume a constant hazard rate or 
assume a particular pattern for the variation of the default probability with time. With 
several bonds, we can estimate several parameters describing the term structure of default 
probabilities. Suppose, for example, that we have bonds maturing in 3,5, 7, and 10 years 
and we assume a step function for the default probability. We could use the first bond 
to estimate the default probability per year for the first three years, the second bond to 
estimate the default probability per year for years 4 and 5, the third bond to estimate the 
default probability per year for years 6 and 7, and the fourth bond to estimate the default 
probability per year for years 8, 9, and 10 (see Problems 19.15 and 19.24). This approach 
is analogous to the bootstrap procedure for estimating the term structure of interest rates 
in Appendix B. 
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Table 19.4 Cumulative Default Probabilities Compared with Credit Spreads 


Cumulative 7-Year Default 7-Year Credit Spread (bp) 
Rating Probabilities (%), 1970-2016 1996-2007 
Aaa 0.195 35.74 
Aa 0.525 43.67 
A 1.297 68.68 
Baa 2.472 127.53 
Ba 11.667 280.28 
B 28.857 481.04 
Caa 42.132 1,103.70 


19.7 Comparison of Default Probability Estimates 


We now compare the default probability estimates calculated from historical data with 
those calculated from credit spreads. Table 19.4 shows: 


The seven-year average cumulative probability of default for companies with different 
credit ratings. This is taken from data published by Moody’s. It is the seven-year column 
of Table 19.1. 

The average credit spread for bonds with different credit ratings between December 
1996 and June 2007. This is taken from data on bond yields published by Merrill 
Lynch and an assumption that the risk-free rate is 10 basis points less than the seven- 


year swap rate. (See discussion of risk-free rate proxy in Section 19.5.2.) The bonds 
had an average maturity of about seven years. 


The table considers data on bond yields only up to the start of the credit crisis. 
During the credit crisis, credit spreads soared. If that unusual period had been included, 
the results we present, which show that hazard rates calculated from credit spreads are 
higher than those calculated from historical default probabilities, would become much 
more extreme. 

Table 19.5 shows estimates of the average seven-year hazard rates.” The historical 
seven-year hazard rates are calculated so that they are consistent with the cumulative 
default rates in Table 19.4. Suppose that Q is the seven-year cumulative default probability. 
From equation (19.2) 


where A is the average hazard rate so that 


7) = 3 In(1 — Q) 


"The analysis here is an update to that in J. Hull, M. Predescu, and A. White, “Bond Prices, Default 
Probabilities, and Risk Premiums,” Journal of Credit Risk 1, no. 2 (Spring 2005): 53—60. 
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Table 19.5 Average Seven-Year Hazard Rates 


Historical Hazard Hazard Rate from 


Rating Rate (%) Credit Spread (%) Ratio Difference 
Aaa 0.028 0.596 21.4 0.568 
Aa 0.075 0.728 9.7 0.653 
A 0.186 1.145 6.1 0.959 
Baa 0.358 2.126 5.9 1.768 
Ba 1.772 4.671 2.6 2.899 
B 4.864 8.017 1.6 3.153 
Caa 7.814 18.395 2.4 10.581 


% per annum. 


Consider, for example, an A-rated company. The cumulative seven-year default rate is 
1.217% or 0.01217 so that the average hazard rate is 


-2 In(1 — 0.01297) = 0.00186 


or 0.186%. The hazard rates estimated from the credit spreads in Table 19.5 are based on 
equation (19.3). The recovery rate is assumed to be 40%. Consider again A-rated bonds. 
The average seven-year credit spread in Table 19.4 is 68.68 basis points or 0.006868. The 
average seven-year hazard rate is therefore 


0.006868/(1 — 0.4) = 0.01145 


Table 19.5 shows that the hazard rates calculated from credit spreads before the credit 
crisis are higher than those calculated from a long period of historical data. The ratio of 
the hazard rate backed out of bond prices to the hazard rate calculated from historical 
data is high for investment-grade bonds and tends to decline as the credit quality declines. 
By contrast, the difference between the two hazard rates tends to increase as credit quality 
declines.!8 

Table 19.6 provides another way of looking at these results. It shows the excess return 
over the risk-free rate (still assumed to be the seven-year swap rate minus 10 basis points) 
earned by investors in bonds with different credit ratings. Consider again an A-rated 
bond. The average spread over Treasuries is 111 basis points. Of this, 42 basis points 
are accounted for by the average spread between seven-year Treasuries and our proxy 
for the risk-free rate. A spread of 11 basis points is necessary to cover expected defaults. 
(This equals the historical hazard rate in Table 19.5 multiplied by one minus the assumed 


18Other studies have identified a similar difference between the two types of default probability esti- 
mates. See, for example, J. S. Fons, “The Default Premium and Corporate Bond Experience,” Journal 
of Finance 42, no. 1 (March 1987): 81-97, and E. I. Altman, “Measuring Corporate Bond Mortality 
and Performance,” Journal of Finance 44, no. 4 (September 1989): 909-922. 
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Table 19.6 Expected Excess Return on Bonds 


Bond Yield Spread of Risk-Free Spread for Expected 
Spread over Rate over Historical Excess 
Treasuries (bp) Treasuries (bp) Defaults (bp) Return (bp) 
Aaa 78 42 2 34 
Aa 86 42 5 39 
A 111 42 11 58 
Baa 169 42 21 106 
Ba 322 42 106 174 
B 523 42 292 189 
Caa 1,146 42 469 635 


recovery rate of 0.4.) This leaves an expected excess return (after expected defaults have 
been taken into account) of 58 basis points.!” 

Tables 19.5 and 19.6 show that a large percentage difference between default prob- 
ability estimates translates into a relatively small expected excess return on the bond. For 
Aaa-rated bonds, the ratio of the two hazard rates is 21.4, but the expected excess return 
is only 34 basis points. The expected excess return tends to increase as credit quality 
declines. 

The excess return in Table 19.6 does not remain constant through time. Credit 
spreads, and therefore excess returns, were high in 2001, 2002, and the first half of 2003. 
Credit spreads were very high during the crisis and, if excess returns had been calculated 
for the crisis period, they would have been high as well. 


19.7.1 Real-World vs. Risk-Neutral Probabilities 


The risk-neutral valuation argument is explained in Section 7.2. It shows that we can 
value cash flows on the assumption that all investors are risk neutral (that is, on the 
assumption that they do not require a premium for bearing risks). When we do this, we 
get the right answer in the real world as well as in the risk-neutral world. 

The theoretical underpinnings of risk-neutral default probabilities are also explained 
in Section 7.2. These default probabilities, sometimes also called implied default prob- 
abilities, are the ones obtained when credit spreads are used for estimation. By contrast, 
the default probabilities estimated from historical data are real-world default probabil- 
ities (sometimes also called physical default probabilities). The expected excess return 
in Table 19.6 arises directly from the difference between real-world and risk-neutral 
default probabilities. If there was no expected excess return, the real-world and risk- 
neutral default probabilities would be the same, and vice versa. 


'°To avoid complications, the calculations do not make any adjustments for the impact of compound- 
ing frequency issues on spreads, returns, and hazard rates. 
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Why do we see such big differences between real-world and risk-neutral default 
probabilities? As we have just argued, this is the same as asking why corporate bond 
traders earn more than the risk-free rate on average. 

One reason for the results is that corporate bonds are relatively illiquid and the returns 
on bonds are higher than they would otherwise be to compensate for this. But this is a 
small part of what is going on. In normal markets, it explains perhaps 25 basis points of the 
excess return in Table 19.6. Another possible reason for the results is that the subjective 
default probabilities of bond traders are much higher than those given in Table 19.1. 
Bond traders may be allowing for depression scenarios much worse than anything seen 
in the period covered by their data. However, it is difficult to see how this can explain a 
large part of the excess return that is observed” 

By far the most important reason for the results in Tables 19.5 and 19.6 is that bonds 
do not default independently of each other. (To put this another way, default correlation 
is a feature of financial markets.) Evidence for this is that default rates vary markedly 
from year to year. Moody’s statistics (see Table 11.6) show that between 1970 and 2016, 
the default rate per year for all rated companies ranged from a low of 0.088% in 1979 
to a high of 4.996% in 2009. This year-to-year variation in default rates gives rise to 
systematic risk (i.e., risk that cannot be diversified away). Bond traders earn an excess 
expected return for bearing this risk. In this respect, bond traders are no different from 
equity traders. The average extra return earned by equity traders for bearing systematic 
risk is often assumed to be 5% or 6% per year. From Table 19.6 we see that the excess 
return earned by bond traders is much less than this for high-quality bonds. However, 
as the bond’s credit quality decreases, it becomes more like equity and the excess return 
earned tends to increase. 

What causes default correlation and the resultant systematic risk? One explanation 
is the economy. Good macroeconomic conditions decrease the probability of default for 
all companies; bad macroeconomic conditions increase the probability of default for all 
companies. (In Vasicek’s model, which was discussed in Chapter 11, the factor F can be 
regarded as representing the overall health of the economy.) Another explanation is what 
is known as “contagion.” This is discussed in Business Snapshot 19.3. 

In addition to systematic risk, which we have just talked about, there is nonsystem- 
atic (or idiosyncratic) risk associated with each bond. If we were talking about stocks, we 
would argue that investors can diversify the nonsystematic risk by choosing a portfolio of, 
say, 30 stocks. They should not therefore demand a risk premium for bearing nonsystem- 
atic risk. For bonds the arguments are not so clear cut. Bond returns are highly skewed, 
with limited upside. (For example, on an individual bond there might be a 99.75% chance 
ofa 7% return in a year, and a 0.25% chance of a—60% return in the year, the first outcome 
corresponding to no default and the second to default.) The nonsystematic component 


0In addition to producing Table 19.1, which is based on the 1970 to 2016 period, Moody’s produces 
a similar table based on the 1920 to 2016 period. When this table is used, historical hazard rates for 
investment-grade bonds in Table 19.5 rise somewhat. However, the non-investment-grade historical 
hazard rates decline. 
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BUSINESS SNAPSHOT 19.3 
Contagion 


Credit contagion is the process whereby a problem in one sector of the world 
economy leads to problems in other, unrelated sectors. When Russia defaulted 
on its debt in 1998, there was a flight to quality and credit spreads on all bonds 
increased. During the credit crisis that started in 2007, there was a similar flight to 
quality and again credit spreads increased. The accompanying recession led to a 
record number of companies defaulting in 2009. In 2011, problems experienced 
by Greece caused investors to be reluctant to buy the debt of other countries 
such as Spain, Ireland, Portugal, and Italy. As a result, credit spreads on the debt 
issued by these countries increased sharply. 

The reasons for credit contagion have been debated by researchers. Systemic 
risk (see Business Snapshot 15.1) is one potential source of contagion. It may be 
the case that investors become more risk averse when they lose money in one 
sector. It may be the case that problems in one sector lead investors to become 
more pessimistic about other, unrelated sectors. Whatever the reason, entities in 
unrelated sectors are liable to find it more difficult to fund their activities and, as 
a result, may become more likely to default. 


of this risk is difficult to “diversify away’?! It requires tens of thousands of different 
bonds to be held. In practice, many bond portfolios are far from fully diversified. As a 
result, bond traders may earn an extra return for bearing nonsystematic risk as well as for 
bearing the systematic risk mentioned in the previous paragraph. 


19.7.2 Which Estimates Should Be Used? 


At this stage it is natural to ask whether we should use real-world or risk-neutral default 
probabilities in the analysis of credit risk. This question is discussed in a more general 
context in Chapter 7. The answer depends on the purpose of the analysis. When valuing 
credit derivatives or estimating the impact of default risk on the pricing of instruments, 
we should use risk-neutral default probabilities. This is because the analysis calculates the 
present value of expected future cash flows and almost invariably (implicitly or explicitly) 
involves using risk-neutral valuation. When carrying out scenario analyses to calculate 
potential future losses from defaults, we should use real-world default probabilities. For 
example, the probability of default used to calculate regulatory capital is a real-world 
default probability. 


*1See J. D. Amato and E. M. Remolona, “The Credit Spread Puzzle,” BIS Quarterly Review (December 
2003): 51-63. 
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19.8 Using Equity Prices to Estimate Default Probabilities 


When we use a table such as Table 19.1 to estimate a company’s real-world probability 
of default, we are relying on the company’s credit rating. Unfortunately, credit ratings are 
revised relatively infrequently. This has led some analysts to argue that equity prices can 
provide more up-to-date information for estimating default probabilities. 

In 1974, Merton proposed a model where a company’s equity is an option on the 
assets of the company.” Suppose, for simplicity, that a firm has one zero-coupon bond 
outstanding and that the bond matures at time T. Define 


Vo: Value of company’s assets today. 

Vr: Value of company’s assets at time T. 

Ep: Value of company’s equity today. 

E,: Value of company’s equity at time T. 

D: Amount of debt interest and principal due to be repaid at time T. 
6: Volatility of assets (assumed constant). 

Oņç: Instantaneous volatility of equity. 


If Vy < D, it is (at least in theory) rational for the company to default on the debt at 
time T. The value of the equity is then zero. If V; > D, the company should make the 
debt repayment at time T and the value of the equity at this time is Vp — D. Merton’s 
model, therefore, gives the value of the firm’s equity at time T as 


Er = max(V, — D,0) 
This shows that the equity of a company is a call option on the value of the assets 
of the company with a strike price equal to the repayment required on the debt. The 


Black—Scholes—Merton formula (see Appendix E at the end of this book) gives the value 
of the equity today as 


Ey = WN (d) — De N(dp) (19.4) 


where 


I- In(V3/D) + (r + 02,/2)T 
l T 
dy = di -6 VT 


and N is the cumulative normal distribution function. 


?2See R. Merton, “On the Pricing of Corporate Debt: The Risk Structure of Interest Rates,” Journal 
of Finance 29 (1974): 449—470. 
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Under Merton’s model, the company defaults when the option is not exercised. The 
probability of this can be shown to be N(—d,). To calculate this, we require V) and Op. 
Neither of these is directly observable. However, if the company is publicly traded, we 
can observe Ep. This means that equation (19.4) provides one condition that must be 
satisfied by V} and op. We can also estimate Op. From a result in stochastic calculus 
known as Ito’s lemma, 


OE 
SgEo = apo o 


Here 0E/OV is the delta of the equity. From Appendix E it is N(d,), so that 
Op Ey = N(di)o y Vo (19.5) 


This provides another equation that must be satisfied by V} and op. Equations (19.4) 
and (19.5) provide a pair of simultaneous equations that can be solved for V} and op”? 


Example 19.4 


The value of a company’s equity is $3 million and the volatility of the equity is 80%. The 
debt that will have to be paid in one year is $10 million. The risk-free rate is 5% per 
annum. In this case, Ey = 3, 6g = 0.80, r = 0.05, T = 1, and D = 10. Solving equations 
(19.4) and (19.5) yields V) = 12.40 and o,, = 0.2123. The parameter, dy, is 1.1408, so 
that the probability of default is N(—d,) = 0.127 or 12.7%. The market value of the 
debt is V, — Ey or 9.40. The present value of the promised payment on the debt is 
10e~°-*! = 9.51. The expected loss on the debt is therefore (9.51 — 9.40) /9.51 or about 
1.2% of its no-default value. 


19.8.1 Extensions of the Basic Model 


The basic Merton’s model we have just presented has been extended in a number of ways. 
For example, one version of the model assumes that a default occurs whenever the value 
of the assets falls below a barrier level. Another allows payments on debt instruments to 
be required at more than one time. Many analysts have found the implied volatility of 
equity issued by a company to be a good predictor of the probability of default. (The 
higher the implied volatility, the higher the probability of default.) Hull et al. (2004) 
show that this is consistent with Merton’s model. They provide a way of implementing 
Merton’s model using two equity implied volatilities and show that the resulting model 
provides results comparable to those provided by the usual implementation of the model. 


3To solve two non-linear equations of the form F(x, y) = 0 and G(x, y) = 0, we can use the Solver 
routine in Excel to find the values of x and y that minimize [F(x, y)]? + [G(x, y)]’. 

See J. Hull, I. Nelken, and A. White, “Merton’s Model, Credit Risk, and Volatility Skews,” Journal of 
Credit Risk 1, no. 1 (2004): 1-27. 
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19.8.2 Performance of the Model 


How well do the default probabilities produced by Merton’s model and its extensions 
correspond to actual default experience? The answer is that Merton’s model and its 
extensions produce a good ranking of default probabilities (risk-neutral or real-world). 
This means that a monotonic transformation can be estimated to convert the proba- 
bility of default output from Merton’s model into a good estimate of either the real- 
world or risk-neutral default probability. Moody’s KMV and Kamakura provide a ser- 
vice that transforms a default probability produced by Merton’s model into a real-world 
default probability. CreditGrades uses Merton’s model to estimate credit spreads, which 
are closely linked to risk-neutral default probabilities. The default probability, N(—d,), is 
in theory a risk-neutral default probability because it is calculated from an option pric- 
ing model. It may seem strange for Moody’s KMV and Kamakura to use it to estimate 
a real-world default probability. Given the nature of the calibration process we have just 
described, the underlying assumption is that the rankings of risk-neutral default proba- 
bilities, real-world default probabilities, and default probabilities produced by Merton’s 
model are all the same. 


19.8.3 Real-World vs. Risk-Neutral Default Probabilities 


Merton’s model provides a way of understanding why default probabilities are higher 
in the risk-neutral world than in the real world. In a risk-neutral world, the expected 
growth rate of the value of the company’s assets is the risk-free rate. In the real world, the 
growth rate of the company’s assets is usually higher than this (reflecting a risk premium 
demanded by the market). The probability of the value of the assets dropping below the 
face value of the debt at a future time is therefore higher in the risk-neutral world than 
in the real world. 


19.8.4 Distance to Default 


The term distance to default has been coined to describe the output from Merton’s model. 
This is the number of standard deviations the asset price must change for default to be 
triggered T years in the future. One definition of distance to default is d, or 


In -1n D+ (r -— 0?,/2)T 
SNT 


As the distance to default declines, the company becomes more likely to default. In 
Example 19.4, the one-year distance to default is 1.14 standard deviations. 


Summary 


The estimation of default probabilities and recovery rates is an important activity for risk 
managers. Ifa company has issued publicly traded debt, credit ratings provide one source 
of information. Rating agencies such as Moody’s provide extensive statistics on default 
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rates for companies that have been awarded particular credit ratings. The recovery rate is 
the value of a bond shortly after default as a percentage of its face value. Rating agencies 
provide statistics on recovery rates for different types of bonds. 

There are a number of sources of information about credit spreads. The credit default 
swap (CDS) market is one such source. A CDS is an instrument where one company 
buys from another company protection against a third company or country (the reference 
entity) defaulting on its obligations. The CDS spread is the amount paid per year for 
protection as a percentage of the notional principal. Two other sources of a company’s 
credit spreads are the excess of yields on bonds issued by the company over the risk- 
free rate and asset swap spreads. Risk-neutral default probabilities can be calculated from 
credit spreads and an assumption about recovery rates. 

The default probabilities that are based on historical data, such as those produced 
by rating agencies, are termed real-world or physical default probabilities. Risk-neutral 
default probabilities are higher than real-world default probabilities. Risk-neutral default 
probabilities should be used for valuation. Real-world default probabilities should be used 
for scenario analysis. Either type of probability can be calculated using a model developed 
by Robert Merton in 1974, provided that the model is calibrated appropriately. 
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Practice Questions and Problems 
(Answers at End of Book) 


19.1 How many different ratings does Moody’s use for investment-grade companies? 
What are they? 
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19.2 


19.3 


19.4 


19.5 


19.6 
19.7 
19.8 


19.9 


19.10 


19.11 


19.12 
19.13 


19.14 


19.15 


How many different ratings does S&P use for investment-grade companies? What 
are they? 

Calculate the average hazard rate for a B-rated company during the first year from 
the data in Table 19.1. 

Calculate the average hazard rate for a Ba-rated company during the third year 
from the data in Table 19.1. 

A credit default swap requires a semiannual payment at the rate of 60 basis points 
per year. The principal is $300 million and the credit default swap is settled in 
cash. A default occurs after four years and two months, and the calculation agent 
estimates that the price of the cheapest deliverable bond is 40% of its face value 
shortly after the default. List the cash flows and their timing for the seller of the 
credit default swap. 

Explain the two ways a credit default swap can be settled. 

Explain the difference between risk-neutral and real-world default probabilities. 

What is the formula relating the payoff on a CDS to the notional principal and 
the recovery rate? 

The spread between the yield on a three-year corporate bond and the yield on 
a similar risk-free bond is 50 basis points. The recovery rate is 30%. Estimate the 
average hazard rate per year over the three-year period. 

The spread between the yield on a five-year bond issued by a company and the 
yield on a similar risk-free bond is 80 basis points. Assuming a recovery rate of 
40%, estimate the average hazard rate per year over the five-year period. If the 
spread is 70 basis points for a three-year bond, what do your results indicate about 
the average hazard rate in years 4 and 5? 

Should researchers use real-world or risk-neutral default probabilities for (a) cal- 
culating credit value at risk and (b) adjusting the price of a derivative for default? 
How are recovery rates usually defined? 

Verify (a) that the numbers in the second column of Table 19.5 are consistent 
with the numbers in the second column of Table 19.4 and (b) that the numbers 
in the fourth column of Table 19.6 are consistent with the numbers in Table 19.5 
and a recovery rate of 40%. 

A four-year corporate bond provides a coupon of 4% per year payable semi- 
annually and has a yield of 5% expressed with continuous compounding. The 
risk-free yield curve is flat at 3% with continuous compounding. Assume that 
defaults can take place at the end of each year (immediately before a coupon or 
principal payment) and the recovery rate is 30%. Estimate the risk-neutral default 
probability on the assumption that it is the same each year using the approach in 
Table 19:3. 

A company has issued three- and five-year bonds, each of which has a coupon of 
4% per annum payable annually. The yields on the bonds (expressed with contin- 
uous compounding) are 4.5% and 4.75%, respectively. Risk-free interest rates are 
3.5% with continuous compounding for all maturities. The recovery rate is 40%. 
Defaults can take place halfway through each year. The unconditional risk-neutral 
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19.16 


19.17 


19.18 


19.19 


19.20 


19.21 
19.22 


19.23 


19.24 


default rates per year are Q, for years 1 to 3 and Q, for years 4 and 5. Estimate 
Q; and Q,. 

Suppose that in an asset swap, B is the market price of the bond per dollar of 
principal, B* is the default-free value of the bond per dollar of principal, and V 
is the present value of the asset swap spread per dollar of principal. Show that 
V=B-B. 

Show that, under Merton’s model in Section 19.8, the credit spread on a T-year 
zero-coupon bond is — In[N (d>) + N(—d,)/L]/T where L = De~'T/V,. 

The value of a company’s equity is $2 million and the volatility of its equity 
is 50%. The debt that will have to be repaid in one year is $5 million. The 
risk-free interest rate is 4% per annum. Use Merton’s model to estimate the 
probability of default. (Hint: The Solver function in Excel can be used for this 
question.) 

A five-year credit default swap entered into on June 20, 2013, requires quarterly 
payments at the rate of 400 basis points per year. The principal is $100 million. 
A default occurs after four years and two months. The auction process finds the 
price of the cheapest deliverable bond to be 30% of its face value. List the cash 
flows and their timing for the seller of the credit default swap. 

“The position of a buyer of a credit default swap is similar to the position of 
someone who is long a risk-free bond and short a corporate bond.’ Explain this 
statement. 

Why is there a potential asymmetric information problem in credit default swaps? 
Suppose that the LIBOR/swap curve is flat at 6% with continuous compounding 
and a five-year bond with a coupon of 5% (paid semiannually) sells for 90.00. 
How much would the bond be worth if it were a risk-free bond? What is the 
present value of the expected loss from defaults? How would an asset swap on the 
bond be structured? What is the asset swap spread that would be calculated in this 
situation? 


Further Questions 


Suppose that a three-year corporate bond provides a coupon of 7% per year 
payable semiannually and has a yield of 5% (expressed with semiannual com- 
pounding). The yield for all maturities on risk-free bonds is 4% per annum 
(expressed with semiannual compounding). Assume that defaults can take place 
every six months (immediately before a coupon payment) and the recovery rate 
is 45%. Estimate the default probabilities assuming (a) the unconditional default 
probabilities are the same on each possible default date and (b) the default prob- 
abilities conditional on no earlier default are the same on each possible default 
date. 

A company has issued one- and two-year bonds providing 8% coupons, payable 
annually. The yields on the bonds (expressed with continuous compounding) 
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19.25 


are 6.0% and 6.6%, respectively. Risk-free rates are 4.5% for all maturities. The 
recovery rate is 35%. Defaults can take place halfway through each year. Estimate 
the risk-neutral default rate each year. 

The value of a company’s equity is $4 million and the volatility of its equity is 
60%. The debt that will have to be repaid in two years is $15 million. The risk-free 
interest rate is 6% per annum. Use Merton’s model to estimate the expected loss 
from default, the probability of default, and the recovery rate (as a percentage of 
the no-default value) in the event of default. (Hint: The Solver function in Excel 
can be used for this question as indicated in footnote 23.) 


Chapter 20 


CVA and DVA 


s explained in Chapter 17, there are two ways in which over-the-counter 

derivatives transactions are cleared: through central clearing parties (CCPs) and 

through bilateral agreements. As a result of regulations introduced since the 
credit crisis, transactions between financial institutions in standardized derivatives have 
to be cleared through CCPs. Transactions in nonstandard derivatives between financial 
institutions are cleared bilaterally, but regulations being introduced will require a great 
deal of collateral to be posted so that there will be virtually no credit risk. This chapter 
focuses on bilaterally cleared over-the-counter derivatives transactions that entail credit 
risk. This includes transactions with non-systemically important end users and trans- 
actions with other counterparties that were entered into before the new rules came 
into effect. 

Assessing the credit risk for derivatives transactions is much more complicated than 
assessing the credit risk for loans because the future exposure (i.e., the amount that could 
be lost in the event of a default) is not known. Ifa bank makes a $10 million uncollater- 
alized five-year loan to a client with repayment of principal at the end, the bank knows 
that its exposure is approximately $10 million at all times during the five-year period. If 
the bank instead enters into a five-year uncollateralized interest rate swap with the client, 
the future exposure is much less certain. This is because the future value of the swap 
depends on movements in interest rates. If the value of the interest rate swap to the bank 
becomes positive, the exposure is equal to the value of the swap (because this is what the 
bank could lose in the event of a counterparty default). If the value becomes negative, 
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the exposure is zero (because in that case the bank would not lose anything in the case 
of a counterparty default). 

The chapter explains the credit value adjustment (CVA), which is the expected loss 
to a derivatives dealer from a default by the counterparty, and the debit (or debt) value 


adjustment (DVA), which is the expected gain to the dealer (loss to the counterparty) 
from a default by the dealer. 


20.1 Credit Exposure on Derivatives 


We start by quickly reviewing the nature of a dealer’s exposure to a counterparty cre- 
ated by derivatives transactions. Consider first the simple case where there is only one 
derivatives transaction outstanding between a derivatives dealer and a counterparty. We 
assume that the transaction is cleared bilaterally and no collateral is posted by either the 
dealer or the counterparty. 

Three possible situations can be distinguished: 


1. The derivative is always a liability to the dealer. 
2. The derivative is always an asset to the dealer. 
3. The derivative can become either an asset or a liability to the dealer. 


An example of a derivative in the first category is a short position in an option; an 
example in the second category is a long position in an option; an example in the third 
category is a forward contract or an interest rate swap transaction. 

In the first case, the dealer has no credit exposure to the counterparty. If the counter- 
party defaults (e.g., because it is bankrupt), the transaction is terminated.’ As explained 
in Section 17.1.5, the settlement will require the dealer to pay the negative value of 
the derivative (adjusted to reflect the bid—offer spread costs that the dealer will incur 
in replacing the transaction) to the estate of the defaulting counterparty. There should 
therefore be neither a gain nor a loss to the dealer. 

In the second case, the dealer always has credit exposure to the counterparty. If the 
counterparty defaults, the dealer is liable to experience a loss. The dealer is an unsecured 
creditor for the value of the derivative (adjusted to reflect the bid—offer spread costs that 
the dealer will incur in replacing the transaction). The amount of the loss depends on 
the value of the derivative at the time of the bankruptcy. 

The third case is more complicated. The dealer may or may not have credit exposure 
to the counterparty in the future. If the counterparty defaults when the value of the 
derivative is positive to the dealer, the dealer is an unsecured creditor, as in the second 


'Note that, because the derivative is an asset to the counterparty, it must be the case that, if 
the counterparty goes bankrupt, it has run into financial difficulties for reasons unrelated to the 
derivative. 
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case. If the counterparty defaults when the value is negative to the dealer, no loss or gain 
to the dealer results, as in the first case. 

When calculating their exposures, dealers typically do not include bid-offer spread 
adjustments mentioned at the end of Section 17.1.5. In the simple example we have been 
considering, the dealer’s net exposure is therefore 


max(V, 0) 


where the variable, V,is the market value of the derivative at the time of the default. 

When there are many derivatives transactions outstanding between the dealer and 
the counterparty, the bilateral agreement between the dealer and its counterparties means 
that they are netted (see Section 15.5). The dealer’s exposure at any given time, assuming 
no collateral is posted, is therefore max(V’, 0), with V equal to the net mark-to-market 
value of all outstanding transactions. 

In calculating its credit risk, the dealer is interested in both its credit exposure today 
and what that exposure might be in the future. Regulators have recognized this. As 
explained in Section 15.3, Basel I used the CEM approach where the loan-equivalent 
exposure is the current exposure plus an add-on amount. A version of this was used 
to calculate exposure at default (EAD) in Basel II and was revised in SA-CCR (see 
Section 16.4). The procedure for calculating EAD in the Advanced IRB approach of 
Basel II involves calculating what is termed the Effective Expected Positive Exposure 
(see Section 15.8.4). 

In this chapter we focus on the calculation of CVA and DVA, which are estimates of 
expected losses from possible defaults. 


20.2 CVA 


A dealer calculates what is known as a credit value adjustment or CVA for each counterparty 
with which it has bilaterally cleared OTC derivatives.” This is an estimate of its expected 
loss from a default by the counterparty. CVA reduces the value of the derivatives on the 
balance sheet, and an increase (decrease) in the total CVA during a period leads to a 
decrease (or increase) in profits for the period. 

Suppose that T is the life of the longest derivative outstanding with a counterparty. 
To calculate CVA, the period of time between time zero (today) and time T is divided 
into a number of intervals (e.g., zero to one month, one to three months, etc.). 


? For further reading on CVA and the way it is calculated, see J. Hull and A. White, “CVA and Wrong 
Way Rusk,” Financial Analysts Journal 65, no. 5 (September/October 2012): 58-69; E. Canabarro and 
D. Duffie, “Measuring and Marking Counterparty Risk,’ Chapter 9 in Asset/Liability Management for 
Financial Institutions, ed. L. Tilman (New York: Institutional Investor Books, 2003); E. Picault, “Calcu- 
lating and Hedging Exposure, CVA, and Economic Capital,” in Counterparty Credit Risk Modeling, ed. 
M. Pykhtin (London: Risk Books, 2005); and J. Gregory, Counterparty Credit Risk: The New Challenge 
for Financial Markets, 2nd ed. (Hoboken, NJ: John Wiley & Sons, 2012). 


462 RISK MANAGEMENT AND FINANCIAL INSTITUTIONS 


Suppose that the ith time interval runs from f,_, to t; (tọ = 0) and there are n intervals. 
Define 


q;: the risk-neutral probability of a loss from a default by the counterparty during the ith 
time interval 

v;: the present value of the expected net exposure of the dealer to the counterparty (after 
collateral) at the midpoint of the ith time interval, conditional on a default 

R: the estimated recovery rate of the dealer in the event of a default by the counterparty 
when it is an unsecured creditor 


Assuming that the net exposure is independent of the probability of default, 
the present value of the expected loss from a default during the ith interval can be 
estimated as 


(1 — R)qjy; 


and the total expected loss is 


CVA = È (1 = R)qir; (20.1) 


i=1 


The q; are risk-neutral default probabilities. (This is because the calculation of CVA 
involves the valuation of potential future cash flows and, as explained in Chapter 7, it 
is correct to use risk-neutral, rather than real-world, default probabilities for valuation.) 
They are estimated from the counterparty’s credit spreads, as discussed in Section 19.6. 
Suppose that s; is an estimate of the counterparty’s credit spread for a maturity of t,. From 
equation (19.3), an estimate of the average hazard rate between time zero and time t; is 


) = 20.2 
=R (20.2) 


and the probability of no default between times zero and t; is 


so that 


or, substituting from equation (20.2) 


Si 8-4 Sib; (20.3 
. = exp | — — exp | — i 
KERNER rte ) 


The v; are usually calculated using Monte Carlo simulation. The market variables 


determining the future value of the transactions that the dealer has with the counterparty 
are simulated in a risk-neutral world between time zero and time T. On each simulation 
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trial the exposure of the dealer to the counterparty at the midpoint of each interval 
is calculated. These exposures are averaged and v; is set equal to the present value of 
the average exposure at the midpoint of the ith interval. Dealers may have transactions 
with thousands of counterparties so that the calculation of the v; for all of them can be 
computationally very intensive. 


20.2.1 Collateral and Cure Periods 


Collateral agreements must be incorporated into the calculation of the v;. A collateral 
agreement between two sides is in the credit support annex (CSA) of their ISDA master 
agreement. This gives the details on how the collateral posted by each side is calculated 
and specifies haircuts on noncash collateral. Suppose that C is defined as the collateral 
posted by the counterparty at the time of a default. (If C is negative, —C is the collateral 
posted by the dealer with the counterparty at the time of the default.) The exposure, E, 
of the dealer to the counterparty is in all situations 


E = max(V — C, 0) (20.4) 


where V is the market value of outstanding transactions to the dealer at the time of the 
default. 

Equation (20.4) reflects the fact that collateral posted by the counterparty can be used 
to reduce (and possibly eliminate) an exposure from a positive V. If the dealer has posted 
collateral with the counterparty (C < 0) it will not be returned and, to the extent that 
it is greater than the value —V of the transactions to the counterparty, it is an exposure. 
As mentioned, in CVA and DVA calculations exposure is usually defined in terms of the 
mid-market value of transactions. (In Section 17.1.5, we explained that for the purposes 
of the claim made in the event of default, mid-market values are adjusted by half the 
bid-offer spread.) 

It is usually assumed that a period of time elapses between the time when a coun- 
terparty stops posting collateral and the close-out of transactions. This period of time is 
known as the cure period (or margin period of risk). It is typically 10 or 20 days. The effect 
of the cure period is that the collateral at the time of a default does not reflect the value 
of the portfolio at the time of the default. It reflects the value 10 or 20 days earlier. 

Suppose that the midpoint of the ith interval is t* so that f* = (t,_, + t)/2 and c is 
the cure period. The Monte Carlo simulation to calculate the v; must be structured so 
that the value of the derivatives portfolio with the counterparty is calculated at times 
t* —c as well as at time f° (i= 1,2, ..., n). On each simulation trial the value at time 
t* — cis used in conjunction with the CSA to calculate the collateral available at time f°. 
The net exposure at time f* is then calculated using equation (20.4) 3 The calculations 
are illustrated in Example 20.1. 


3 This is a simplification. It assumes that the non-defaulting party does not post any collateral or return 
any collateral during the cure period. 
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Example 20.1 


There is a two-way zero threshold collateral agreement between a bank and its coun- 
terparty. This means that the total collateral posted by one side with the other side on 
a day is specified as max(V, 0), where V is the value of the outstanding transactions to 
the other side. The cure period is 20 days. This means that the collateral posted by one 
side at the time of a default can be assumed to equal max(V, 0) where V is the value to 
the other side 20 days earlier. Suppose that time T is the midpoint of one of the intervals 
used in the bank’s CVA calculation. 


1. On a particular simulation trial, the value of outstanding transactions to the bank 
at time T is 50 and their value 20 days earlier is 45. The calculation assumes 
that the bank has collateral worth 45 in the event of a default at time t. The 
bank’s exposure is therefore 5, the uncollateralized value it has in the derivatives 
transactions. 

2. On a particular simulation trial, the value of outstanding transactions to the bank at 
time T is 50 and their value 20 days earlier is 55. In this case, it is assumed that the 
bank will have adequate collateral and its exposure is zero. 

3. On a particular simulation trial, the value of outstanding transactions to the bank at 
time T is —50 and the value 20 days earlier is —45. In this case, the bank is assumed to 
have posted less than 50 of collateral with the counterparty in the event of a default 
at time T and its exposure is zero. 

4. On a particular simulation trial, the value of outstanding transactions to the bank at 
time T is —50 and the value 20 days earlier is —55. In this case, it is assumed that 55 
of the bank’s collateral is held by the counterparty 20 days before time T and, in the 
event of a default at time T, none of it is returned. The bank’s exposure is therefore 
5, the excess collateral it has posted. 


20.2.2 Peak Exposure 


In addition to calculating CVA, dealers usually calculate the peak exposure at the midpoint 
of each interval. This is a high percentile of the exposures given by the Monte Carlo 
simulation trials. Suppose for example that the percentile chosen is 97.5% and there are 
10,000 Monte Carlo simulation trials. The peak exposure at time f is the 250th highest 
exposure recorded at that time. The maximum peak exposure is the maximum of the peak 
exposures across all the f*. 

There is a theoretical issue here (which is often ignored in practice). To calculate 
CVA, we simulate the behavior of market variables in the risk-neutral world and discount 
at the risk-free rate. (As explained in Chapter 7, it is correct to do this when valuation is 
the objective.) When we calculate peak exposure, we are carrying out a scenario analysis. 
We are in effect asking, “How bad can our exposure to the counterparty get in the 
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future?” For this purpose, as explained in Chapter 7, we should in theory simulate the 
behavior of market variables in the real world, not the risk-neutral world* 


20.2.3 Downgrade Triggers 


Netting and collateral agreements are important ways in which credit risk is reduced 
in bilaterally cleared derivatives transactions. Sometimes the credit support annex to an 
ISDA master agreement also includes a downgrade trigger. This is a clause stating that if 
the credit rating of one side falls below a certain level, the other side has the right to ask 
for more collateral or to terminate transactions. 

AIG provides an example of the operation of a downgrade trigger. Many of AIG’s 
transactions stated that AIG did not have to post collateral provided its credit rating 
remained above AA. However, once it was downgraded below AA, collateral was required. 
AIG was downgraded below AA by all three rating agencies on September 15, 2008. This 
led to collateral calls, which AIG was unable to meet. Its bankruptcy was avoided by a 
massive government bailout. 

Other companies that have run into problems because of downgrade triggers include 
Enron, Xerox, and Ambac. Downgrade triggers do not provide protection to a company’s 
counterparties from a big jump in a company’s credit rating (for example, from AA to 
default). Also, downgrade triggers work well for a company’s counterparties only when 
the company is making relatively little use of them. Ifa company has many downgrade 
triggers in its contracts, an AlG-type situation can arise where a downgrade below a 
specified credit rating can lead to huge cash demands on the company. If these cash 
demands cannot be met, an immediate bankruptcy follows. 


20.3 The Impact of a New Transaction 


When a new derivatives transaction is being negotiated by a dealer with a counterparty, 
its incremental effect on CVA may influence the terms offered. If the value of the new 
transaction is positively correlated with other transactions entered into by the dealer with 
the counterparty, the incremental effect on CVA is likely to be positive. If this correlation 
is negative, the new transaction can have the effect of reducing CVA. 

Suppose, for example, that a dealer has one transaction outstanding with a counter- 
party: a five-year forward foreign currency transaction where the counterparty is buying 
the currency. If the counterparty then expresses interest in entering into a three-year 
forward foreign currency transaction, the competitiveness of the dealer’s quote might 
depend on which side of the transaction the counterparty wants to take. If the counter- 
party wants to buy the currency, the new transaction would have the effect of increasing 


+ When the peak exposure is being calculated for times that are not far in the future, the difference 
between simulating in the real world and in the risk-neutral world is usually quite small. 
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CVA (making the transaction less attractive to the dealer). On the other hand, if the coun- 
terparty wants to sell the currency, it would be netted with the existing transaction and 
have the effect of reducing CVA (making the transaction more attractive to the dealer). 
These CVA/netting considerations suggest that when transactions are cleared bilaterally 
and are not fully collateralized, a company will sometimes get the most favorable quotes 
from a dealer with whom it already has transactions outstanding rather than from a dealer 
with whom it has done no previous business. 

Calculating CVA is computationally intensive. Often dealers have hundreds or thou- 
sands of transactions outstanding with a counterparty. Calculating the incremental effect 
of a new transaction on CVA by recomputing CVA is not usually feasible. Luckily there 
is a computationally efficient approach for calculating incremental CVA. 

When the CVA calculations in Section 20.2 are carried out, the paths followed by 
market variables on each simulation trial and the value of the portfolio on each simu- 
lation trial are stored When a potential new transaction is being considered, its value 
at future times is calculated for the values of the market variables that were obtained on 
the simulation trials. This gives the incremental impact of the transaction on the future 
portfolio value for each of the last-used Monte Carlo simulation trials at all future times. 
The incremental impact on the exposure at each time for each Monte Carlo trial can 
then be calculated. From this, the incremental effect on the average exposure at each 
time can be calculated and equation (20.1) can then be used to calculate the incremental 
effect on CVA. 

To illustrate this, suppose that a portfolio with a counterparty depends only on the 
price of gold and that the price of gold at time 2.5 years on the 545th simulation trial 
when CVA is calculated is $1,572, with the value of the portfolio to the dealer being 
$2.4 million. Assuming no collateral is posted, this is also the value of the exposure. If 
2.5 years is the midpoint of the 20th time step, this means that v,, is the present value of 
$2.4 million received at time 2.5 years. We suppose that this is $2.3 million. 

Suppose that, shortly after CVA is calculated, a potential new transaction with the 
counterparty, dependent on the price of gold, is contemplated. This transaction is valued 
at all times for the paths followed by the price of gold on all simulation trials. Suppose 
that, on the 545th simulation trial, it is calculated that the value of the new transaction 
will be -$4.2 million at time 2.5 years (when the price of gold is $1,572). This means that 
the portfolio value reduces from $2.3 million to —$1.9 million at time 2.5 years on the 
545th simulation trial as a result of the new transaction. This reduces the exposure to zero 
so that the new v» is zero. The new transaction therefore has the effect of reducing vzo 
by $2.3 million. Similar calculations are carried out for all simulation trials and all times. 


5 Lehman Brothers, for example, had about 1.5 million derivatives transactions with about 8,000 coun- 
terparties at the time of its failure. 

6 To be precise, the values of all market variables at the t“ and t“ —c for 1<i<nand the MTM 
portfolio values at these times are stored until the next time CVA is calculated (at which time they 
can be deleted). 
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Define Av, as the average change in v; across all the simulation trials. The incremental 
effect on CVA of the new transaction is estimated as 


n 


DY = R)giAy; 


i=1 


20.4 CVA Risk 


A dealer has one CVA for each counterparty. These CVAs can themselves be regarded as 
derivatives. They are particularly complex derivatives. Indeed, the CVA for a counterparty 
is more complex than any of the transactions between the dealer and the counterparty 
because it is contingent on the net value of all the transactions between the dealer and 
the counterparty. 

When CVA increases (decreases), the income reported by a derivatives dealer 
decreases (increases). For this reason, many dealers consider it prudent to try to hedge 
CVAs in the same way that they hedge other derivatives. This means that they must 
calculate the Greek letters (delta, gamma, vega, etc.) discussed in Chapter 8. 

The variables affecting the v, are market variables such as interest rates, exchange 
rates, commodity prices, and so on. Calculating Greek letters for these is liable to be 
computationally quite time consuming. For example, to calculate the delta of CVA with 
respect to an exchange rate, it is necessary to make a small change to the exchange rate 
and recompute CVA. A technique known as adjoint differentiation can be used to ease 
the computational burden.’ 

The variables affecting the q; are the credit spreads of the counterparty for different 
maturities. From equation (20.3) 


From equation (20.1) 


n 


CVA = (1 = R)qv; 


i=1 


7 See, for example, M. Giles and P. Glasserman, “Smoking Adjoints: Fast Monte Carlo Greeks,” Risk 
19, no. 1 (2006): 88-92; and M. Henrard, “Adjoint Algorithmic Differentiation: Calibration and the 
Implicit Function Theorem,” Journal of Computational Finance 17, no. 4 (2014): 37—47. 
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Using a delta/gamma approximation, the change in CVA resulting from a small par- 
allel shift, As, in the term structure of credit spreads (with all the market variables deter- 
mining the v; being assumed to remain fixed) is therefore 


n P at 
A(CVA) = > h exp (-; 1) — t;_1 exp (-4)| v,As 


i=1 


This can be calculated without difficulty once the v; are known. 

Basel III requires dealers to use this equation to incorporate the risks arising from 
changes in credit spreads into market risk capital calculations. Originally, risks arising 
from changes in the market variables affecting the v; were not included in market risk 
capital calculations. This had the effect of discouraging dealers from hedging the risks 
associated with the v; because the impact of hedging would be to increase rather than 
reduce the market risk assessed for capital purposes. This issue was addressed by the Basel 
Committee in 20153 


20.5 Wrong-Way Risk 


Up to now we have assumed that the probability of default is independent of the exposure. 
A situation where there is a positive dependence between the two, so that the probability 
of default by the counterparty tends to be high (low) when the dealer’s exposure to the 
counterparty is high (low), is referred to as wrong-way risk. A situation where there is 
negative dependence, so that the probability of default by the counterparty tends to be 
high (low) when the dealer’s exposure to the counterparty is low (high), is referred to as 
right-way risk. 

A subjective evaluation of the amount of wrong-way or right-way risk in transac- 
tions with a counterparty requires a good knowledge of the counterparty’s business, in 
particular the nature of the risks facing the business. It also requires knowledge of the 
transactions the counterparty has entered into with other dealers. The latter is difficult 
to know with any precision. 

One situation in which wrong-way risk tends to occur is when a counterparty is 
using a credit default swap to sell protection to the dealer. (AIG is an example here.) When 
a dealer buys protection from the counterparty and the credit spread of the reference 
entity increases, the value of the protection to the dealer becomes positive. However, 
because the credit spreads of different companies tend to be correlated, it is likely that 


8See Basel Committee on Banking Supervision, “Review of Credit Valuation Adjustment Risk 
Framework,’ October 1, 2015. 
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the credit spread of the counterparty has increased so that the counterparty’s calculated 
probability of default has also increased. Similarly, right-way risk tends to occur when a 
counterparty is buying credit protection from the dealer. 

A situation in which a company is speculating by entering into many similar trades 
with one or more dealers is likely to lead to wrong-way risk for these dealers. This is 
because the company’s financial position and therefore its probability of default is likely 
to be affected adversely if the trades move against the company. 

If a company enters into transactions with a dealer to partially hedge an existing 
exposure, there should in theory be right-way risk. This is because, when the transactions 
move against the counterparty, it will be benefiting from the unhedged portion of its 
exposure so that its probability of default should be relatively low? 

A simple way of dealing with wrong-way risk is to use what is termed the “alpha” 
multiplier to increase v; in the version of the model where the v; and q; are assumed to 
be independent. The effect of this is to increase CVA by the alpha multiplier. Basel rules 
set alpha equal to 1.4, but allow banks to use their own models, with a floor for alpha 
of 1.2. This means that, at minimum, the CVA has to be 20% higher than that given by 
the model where the v; and q; are assumed to be independent. If a bank does not have 
its own model for wrong-way risk, it has to be 40% higher. Estimates of alpha reported 
by banks range from 1.07 to 1.10. 

Some models have been developed to capture the dependence of the probability of 
default on the exposure. For example, Hull and White (2012) propose a simple model 
where the hazard rate at time f is a function of variables observable at that time.'° Their 
parameter describing the extent of the dependence can be either estimated subjectively 
or estimated by relating past credit spreads for the counterparty to what the value of 
the current portfolio would have been in the past. Implementing the model involves 
relatively minor modifications to the calculations outlined in Section 20.2. 


20.6 DVA 


Debit value adjustment (DVA) is the mirror image of CVA.'' Whereas CVA is the 
expected cost to the dealer from a possible default by the counterparty, DVA is the 
expected cost to the counterparty because the dealer might default. It is the counter- 
party’s CVA. If DVA is a cost to the counterparty, it must be a benefit to the dealer. (This 
is because derivatives are zero sum games: the gain to one side always equals the loss to 
the other side.) The benefit arises from the fact that the dealer when it defaults avoids 
payments that would otherwise be required on outstanding derivatives. Accounting 


? An exception could be when the counterparty is liable to run into liquidity problems. Although the 
assets being hedged have increased in value, the counterparty might be unable to post collateral when 
required. An example here is Ashanti Goldfields (see Business Snapshot 24.2). 

10 See J. Hull and A. White, “CVA and Wrong Way Risk,” Financial Analysts Journal 68, no. 5 (Septem- 
ber/October 2012): 58-69. 

'IDVA is sometimes also referred to as “debt value adjustment.” 
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standards recognize both CVA and DVA. The book value of the derivatives outstanding 
with a counterparty to be calculated is 


fa — CVA + DVA 


where f q is the value of the derivatives assuming neither side will default. 

DVA can be calculated at the same time as CVA. Equation (20.1) gives DVA rather 
than CVA if R is the recovery rate of the counterparty in the event of a default by the 
dealer, v; is the present value of the counterparty’s exposure to the dealer at the midpoint 
of the ith interval, and q; is the probability of a default by the dealer during the ith 
interval. The counterparty’s net exposure to the dealer takes account of collateral posted 
by the dealer with the counterparty in the same way that the dealer’s net exposure to the 
counterparty takes account of the collateral posted by the counterparty with the dealer. 

One surprising effect of DVA is that when the credit spread of a derivatives dealer 
increases, DVA increases. This in turn leads to an increase in the reported value of the 
derivatives on the books of the dealer and a corresponding increase in its profits. Some 
banks reported several billion dollars of profits from this source in the third quarter of 
2011. Regulators are uncomfortable with this and have excluded DVA gains and losses 
from the definition of common equity in the determination of regulatory capital. 


20.7 Some Simple Examples 


To illustrate the ideas presented in this chapter, we now present some simple examples. 


20.7.1 Single Transaction with Positive Value 


Suppose first that a dealer has a single derivatives transaction with a counterparty that 
is bound to have a positive value to the dealer and a negative value to the counterparty 
at all future times. (An example would be the situation where the dealer has bought 
an option from the counterparty.) We suppose that no collateral has to be posted by the 
counterparty and, for ease of exposition, assume that payoffs on the derivatives transaction 
occur only on its expiration date. 

The exposure of the dealer to the counterparty at a future time is the value of the 
transaction at that time. The present value of the expected exposure at time ¢,, which we 
have denoted by v;, is therefore the present value of the expected value of the transaction 
at time t;. Because we are assuming no payoffs before maturity, the present value of the 
expected value of the transaction at time t; is always equal to its value today. 

Equation (20.1) therefore becomes 


CVA = (1-R)f, >a, 
i=1 


where fọ is the value of the derivative today, assuming no defaults. If f," is the value after 
defaults are taken into account: 


Sy =f — CVA 
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or 
=h i -(1-R) Ý, | (20.6) 
i=1 


This means that the effect of defaults is to reduce the value of the derivative by a pro- 
portional amount equal to the cumulative risk-neutral probability of default during the 
life of the derivative times one minus the recovery rate. 

Now consider an unsecured zero-coupon bond issued by the counterparty that 
promises $1,000 at time T. Define Bọ as the value of the bond assuming no possibil- 
ity of default and By as the actual value of the bond. Assuming that the bond ranks 
equally with the derivative in the event of a default, it will have the same recovery rate. 
Similarly to equation (20.6) 


=D, i -(1-R) Ý, | (20.7) 
i=1 


From equations (20.6) and (20.7) 


fy _ Bo (20.8) 
Jo By 


If y is the yield on a risk-free zero-coupon bond maturing at time T and y* is the yield on 


a zero-coupon bond issued by the counterparty that matures at time T, then By = e~!7 


and By = e™T so that equation (20.8) gives 


fi =p ro (20.9) 


This shows that the derivative can be valued by increasing the discount rate that is applied 
to the expected payoff by y* — y. 
Example 20.2 


Consider a two-year uncollateralized over-the-counter option sold by a company with 
a value, assuming no possibility of default, of $3. Suppose that two-year zero-coupon 
bonds issued by the company have a yield that is 1.5% greater than a similar risk-free 
zero-coupon bond. The value of the option is 


3070.015X2 — 9 94 


or $2.91. 
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Figure 20.1 Expected Exposure on a Matched Pair of Offsetting Interest Rate Swaps and a 
Matched Pair of Offsetting Currency Swaps 


20.7.2 Interest Rate Swaps vs. Currency Swaps 


Consider next the situation where the dealer has entered into a pair of swaps that offset 
each other with two different counterparties. Figure 20.1 compares the total expected 
future exposure on the two transactions when they are (a) currency swaps and (b) inter- 
est rate swaps. The expected exposure on the interest rate swaps starts at zero, increases, 
and then decreases. By contrast, expected exposure on the currency swaps increases 
steadily with the passage of time. The main reason for the difference is that principals 
are exchanged at the end of the life of a currency swap and there is uncertainty about 
the exchange rate at that time. By contrast, toward the end of the life of the interest 
rate swap, very little is still to be exchanged. The impact of default risk for a dealer in 
currency swaps is therefore much greater than for a dealer in interest rate swaps. The q; 
that the dealer calculates for counterparties in equation (20.1) are the same regardless of 
the transaction, but the v; are on average greater for currency swaps. 


20.7.3 Single Forward Transaction 


For another example, we suppose that a dealer has a single forward transaction with a 
counterparty giving the dealer the right to buy an asset from the counterparty at time 
T for a price K. No collateral has to be posted. Suppose that the forward price of the 
asset today is Fy (which is known) and at time t (t < T) it is F, (which is unknown). As 
explained in Appendix C, the value of the transaction at time t is 


(F, = Kje T9 


where r is the risk-free interest rate (assumed constant). 
The exposure at time f is 


max[(F, — K)e™ T, 0] = e~? max[(F, — K), 0] (20.10) 
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The present value of the exposure is therefore e~”? times the value of a derivative that 
pays off max[(F, — K), 0] at time t. This is an option on a forward price. From Appendix 


E, the value of the derivative is 


e"[FoN(d,) — KN(d,)| 


where 
t= ln(F)/K) + 0°t/2 
oyt 
and 
ga ln(F)/K) — 0°t/2 


oy! 


and o is the volatility of the forward price of the asset. It follows from equation (20.10) 
that the present value of the exposure at time t is 


eT [FoN(d,) — KN(d,)] 


The exposure, v,, at a possible default time, t;, is therefore 


cae eT [FN (d) — KN(d; ;)] (20.11) 
where 
i= In(F)/K) + 07t,/2 
i é Vi 
and 


_ In(Fo/K) — 0°t;/2 


d,; 
2,1 N 


Example 20.3 


Suppose that a bank has entered into a forward contract to buy 1 million ounces of gold 
from a mining company in two years for $1,500 per ounce. The current forward price 
for the contract is $1,600 per ounce. The probability of the company defaulting during 
the first year is 2% and the probability of the company defaulting during the second year 
is 3%. Defaults are assumed to happen at the midpoints of the years. The risk-free rate 
is 5% per annum. The financial institution anticipates a 30% recovery in the event of a 
default. The volatility of the forward price of gold when the forward contract expires in 
two years is 20%. 
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In this case, from equation (20.11) 
v = & 11 600N(d, 1) — 1,500N(d>,1)] 


where 


In(1,600/1,500) + 0.27 x 0.5/2 


a, = = 05271 
0.2/0.5 
In(1,600/1,500) — 0.2? x 0.5/2 
ie a al RC 


0.2/0.5 


so that v, = 132.38. Also, from equation (20.11) 
by = e711, 600N(d; 2) — 1,500N (dy 2)] 


where 


In(1,600/1,500) + 0.2? x 1.5/2 
i= ee see 


0.21.5 


j, = In(1,600/1,500) — 0.2? x 1.5/2 
ne a_a 
0.24/1.5 


so that v, = 186.65. 
Other variables are: q} = 0.02, q) = 0.03, and R = 0.3 so that 


= 0.1410 


CVA = (1 — 0.3) x (0.02 x 132.38 + 0.03 x 186.65) = 5.77 


The no-default value of the forward contract is (1,600 — 1,500)e7?%0-05 = 90.48 or 
$90.48. The value after the possibility of defaults has been taken into account is therefore 


90.48 — 5.77 = 84.71 


or $84.71. 
This example can be extended to the situation where the mining company can 
default more frequently or to calculate DVA (see Problems 20.13 and 20.14). 


Summary 


A derivatives dealer’s credit value adjustment (CVA) for its bilaterally cleared derivatives 
with a counterparty is the present value of the expected future loss from the possibility 
of the counterparty defaulting. A derivatives dealer typically has systems in place for 
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calculating the credit value adjustments (CVAs) for all the counterparties with which 
it has bilaterally cleared derivatives. The systems must simulate the market variables 
underlying outstanding transactions with each counterparty so that the expected net 
exposure at future times conditional on a default can be estimated. The calculations 
take account of netting and collateral agreements. The simplest assumption is that the 
probability of default is independent of the exposure. Procedures can be developed to 
incorporate wrong-way risk (where there is positive dependence between the two) and 
right-way risk (where there is negative dependence between the two). 

CVAs are complex derivatives, and many dealers hedge them in the same way that 
they hedge other derivatives. There are two types of risks associated with CVAs: the risk 
that the counterparty’s credit spread will change and the risk that there will be movements 
in the values of the underlying market variables. Once a CVA has been calculated, it is 
relatively easy to quantify the first risk, and the original Basel HI regulations require 
dealers to keep market risk capital for this risk. Calculating the sensitivity of CVA to 
movements in the underlying market variables is more difficult, but regulatory capital 
has been proposed for this risk as well. 

A dealer’s debit value adjustment (DVA) for its bilaterally cleared derivatives with a 
counterparty is the present value of the expected future loss to the counterparty from 
the possibility of the dealer defaulting. This is a gain to the dealer. It can be calculated at 
the same time as CVA. 
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Practice Questions and Problems 
(Answers at End of Book) 


20.1 Explain why a new transaction by a bank with a counterparty can have the effect 
of increasing or reducing the bank’s credit exposure to the counterparty. 
20.2 A company offers to post its own equity as collateral. How would you respond? 
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20.3 


20.4 


20.5 


20.6 
20.7 


20.8 
20.9 


20.10 


20.11 
20.12 


20.13 


20.14 


Suppose that a financial institution has two derivatives transactions outstanding 

with different counterparties, X and Y. Which of the following is/are true? 

(a) The total expected exposure in one year on the two transactions is the sum of 
the expected exposure on the transaction with X and the expected exposure 
on the transaction with Y. 


(b 


The total present value of the cost of defaults is the sum of the present value 


= 


of the cost of defaults on the transaction with X plus the present value of the 
cost of defaults on the transaction with Y. 
The 95th percentile for the total exposure in one year on both transactions is 


— 
lo) 
Pe 


the sum of the 95th percentile for the exposure in one year on the transaction 
with X and the 95th percentile for the exposure in one year on the transaction 
with Y. 
Explain your answers. 
“In the absence of collateral and other transactions between the parties, a long 
forward contract subject to credit risk is a combination of a short position in a 
no-default put and a long position in a call subject to credit risk’ Explain this 
statement. 
Suppose that the spread between the yield on a three-year riskless zero-coupon 
bond and a three-year zero-coupon bond issued by a corporation is 120 basis 
points. By how much do standard option pricing models such as Black—Scholes— 
Merton overstate the value of a three-year option sold by the corporation? Assume 
there is only this one transaction between the corporation and its counterparty 
and no collateral is posted. 
Can the existence of default triggers increase default risk? Explain your answer. 
Give two examples of when (a) wrong-way risk and (b) right-way risk can be 
expected to be observed. 
Explain the term “cure period.” 
“Netting means that CVA cannot be calculated on a transaction-by-transaction 
basis.” Explain this statement. 
“DVA can improve the bottom line when a bank is experiencing financial diffi- 
culties.’ Explain why this is so. 
What part of CVA risk is considered a component of market risk in Basel III? 
A CSA between a dealer and one of its counterparties states that collateral has to 
be posted by both sides with zero thresholds. If the cure period is assumed to be 
15 days, under what circumstances will the dealers CVA model lead to losses? 


Further Questions 


Extend Example 20.3 to calculate CVA when default can happen in the middle of 
each month. Assume that the default probability during the first year is 0.001667 
per month and the default probability during the second year is 0.0025 per month. 
Calculate DVA for the bank in Example 20.3. Assume that the bank can default 
in the middle of each month and that the default probability is 0.001 per month 
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20.15 


20.16 


for the two years. Assume that the recovery rate for the counterparty when the 

bank defaults is 40%. 

Consider a European call option on a non-dividend-paying stock where the stock 

price is $52, the strike price is $50, the risk-free rate is 5%, the volatility is 30%, 

and the time to maturity is one year. Answer the following questions assuming 

no recovery in the event of default, that the probability of default is independent 
of the option valuation, no collateral is posted, and no other transactions between 
the parties are outstanding. 

(a) What is the value of the option, assuming no possibility of a default? 

(b) What is the value of the option to the buyer if there is a 2% chance that the 
option seller will default at maturity? 

(c) Suppose that, instead of paying the option price up front, the option buyer 
agrees to pay the option price (with accumulated interest) at the end of the 
option’s life. By how much does this reduce the cost of defaults to the option 
buyer in the case where there is a 2% chance of the option seller defaulting? 

(d) If in case (c) the option buyer has a 1% chance of defaulting at the end of 
the life of the option, what is the default risk to the option seller? Discuss the 
two-sided nature of default risk in the case and the value of the option to each 
side. 

Suppose that the spread between the yield on a three-year riskless zero-coupon 

bond and a three-year zero-coupon bond issued by a bank is 210 basis points. 

The Black—Scholes—Merton price of an option is $4.10. How much should you 

be prepared to pay for it if you buy it from a bank? 


Chapter 21 


Credit Value at Risk 


T he value at risk (VaR) measure is central to the determination of regulatory 
capital for credit risk and to much of the credit risk management carried out 
internally by both financial and nonfinancial corporations. This chapter covers 
alternative approaches for calculating credit risk VaR. 

Credit risk VaR is defined similarly to market risk VaR. It is the credit risk loss over 
a certain time period that will not be exceeded with a certain confidence level. Some 
credit risk VaR models consider only losses from defaults; others consider losses from 
downgrades or credit spread changes as well as from defaults. 

Banks calculate credit risk VaR to determine both regulatory capital and economic 
capital. The regulatory capital requirements for credit risk are discussed in Chapters 15 
and 16. Economic capital, which is discussed in Chapter 26, is a financial institution’s 
own estimate of the capital it requires for the risks it is taking and is used to calculate 
return on capital measures for its business units. Sometimes the VaR model that a bank 
chooses to use to determine credit risk economic capital is different from the one it is 
required to use for the determination of regulatory capital. 

Value at risk is a scenario analysis measure. As explained in Chapter 7, the probability 
of default estimates used to develop alternative horizon-date scenarios should therefore 
be real-world estimates. Risk-neutral estimates should then be used to value the underly- 
ing portfolio on the VaR horizon date. It will be recalled from Chapter 19 that real-world 
default probabilities are lower than risk-neutral default probabilities. 

The time horizon for credit risk VaR is often longer than that for market risk VaR. 
Market risk VaR is often calculated with a one-day time horizon. Credit risk VaR, for 
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instruments that are not held for trading, is usually calculated with a one-year time hori- 
zon. Historical simulation (see Chapter 13) is the main tool used to calculate market risk 
VaR, but a more elaborate model is usually necessary to calculate credit risk VaR. 

A key aspect of any credit risk VaR model is credit correlation. Defaults for different 
companies do not happen independently of each other. During an economic down- 
turn, most companies are adversely affected and become more likely to default. When 
the economy is faring well, they are favorably affected and less likely to default. This 
relationship between default rates and economic factors is a major reason for credit cor- 
relation. If credit correlation increases (as it tends to do in stressed economic conditions), 
the risk for a financial institution with a portfolio of credit exposures increases. 


21.1 Ratings Transition Matrices 


The methods used by financial institutions for calculating credit VaR often involve ratings 
transition matrices. These are matrices showing the probability of a company migrating 
from one rating category to another during a certain period of time. They are based on 
historical data. The rating categories can be either those used internally by the finan- 
cial institution or those produced by rating agencies such as Moody’s, S&P, and Fitch. 
Table 21.1 shows the one-year transition matrix produced by Moody’, based on follow- 
ing the performance of all the companies that Moody’s rated between 1970 and 2016. 
For example, it shows that a company that starts with a rating of A has a 90.90% prob- 
ability of still being rated A at the end of a year. There is a 2.64% chance that it will be 
upgraded to Aa by the end of the year, a 5.67% chance that it will be downgraded to 
Baa, and so on. The probability that it will default during the year is only 0.06%, or 6 
chances in 10,000. 

If we assume that the rating change in one period is independent of that in another 
period, Table 21.1 can be used to calculate a transition matrix for periods other than one 


Table 21.1 One-Year Ratings Transition Matrix, 1970—2016, with Probabilities Expressed as 
Percentages and Adjustments for Transition to the WR (without rating) Category 


Rating at Year-End 


Initial 

Rating Aaa Aa A Baa Ba B Caa Ca-C Default 
Aaa 90.94 8.36 0.59 0.08 0.02 0.00 0.00 0.00 0.00 
Aa 0.87 89.68 8.84 0.45 0.07 0.04 0.02 0.00 0.02 
A 0.06 2.64 90.90 5.67 0.51 0.12 0.04 0.01 0.06 
Baa 0.04 0.16 4.44 90.16 4.09 0.75 0.17 0.02 0.18 
Ba 0.01 0.05 0.47 6.66 83.03 7.90 0.78 0.12 0.99 
B 0.01 0.03 0.16 0.51 5.32 82.18 7.39 0.61 3.79 
Caa 0.00 0.01 0.03 0.11 0.46 7.82 78.52 3.30 9.75 
Ca-C 0.00 0.00 0.07 0.00 0.80 3.19 11.41 51.28 33.24 


Default 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 


Source: Moody’s. 
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Table 21.2 Five-Year Ratings Transition Matrix Calculated from Table 21.1 with Probabilities 
Expressed as Percentages 


Rating at Year-End 


Initial 

Rating Aaa Aa A Baa Ba B Caa Ca-C Default 
Aaa 62.75 28.14 7.58 1.16 0.20 0.07 0.02 0.00 0.03 
Aa 2.96 60.27 29.98 5.27 0.82 0.33 0.13 0.01 0.20 
A 0.40 9.00 65.74 19.73 3.19 1.08 0.32 0.04 0.55 
Baa 0.18 1.45 15.45 63.40 12.25 4.33 1.18 0.13 1.68 
Ba 0.06 0.39 3.57 19.72 43.71 19.78 5.10 0.55 7.16 
B 0.04 0.16 0.94 3.83 13.22 43.21 16.56 1.65 20.39 
Caa 0.01 0.06 0.26 0.94 3.46 17.61 34.25 3.53 39.88 
Ca-C 0.00 0.03 0.21 0.51 2.09 7.30 12.50 4.53 72.82 


Default 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 


year. For example, a transition matrix for two years can be calculated by multiplying the 
matrix by itself. The five-year transition matrix, which is shown in Table 21.2, is calcu- 
lated as the fifth power of the matrix in Table 21.1. Table 21.2 shows, not surprisingly, 
that the probability of a company keeping the same credit rating over five years is much 
lower than it is over one year, and default probabilities over five years are much higher 
than over one year. 

The credit rating change over a period less than a year is not quite so easy to calculate. 
For example, estimating a transition matrix for six months involves taking the square root 
of the matrix in Table 21.1; estimating the transition matrix for three months involves 
taking the fourth root of the matrix; and so on. The calculation methodology is explained 
in Appendix J at the end of the book and software for performing the calculations is on 
the author’s website: www-2.rotman.utoronto.ca/~hull/riskman. 

Table 21.3 shows the rating transition calculated for a period of one month from the 
data in Table 21.1. As might be expected, the probability that a company’s credit rating 
will stay the same over a period of one month is very high. 


Table 21.3 One-Month Ratings Transition Matrix Calculated from Table 21.1 with Probabilities 
Expressed as Percentages 


Rating at Year-End 


Initial 

Rating Aaa Aa A Baa Ba B Caa Ca-C Default 
Aaa 99.21 0.77 0.02 0.01 0.00 0.00 0.00 0.00 0.00 
Aa 0.08 99.08 0.81 0.02 0.00 0.00 0.00 0.00 0.00 
A 0.00 0.24 99.19 0.52 0.04 0.01 0.00 0.00 0.00 
Baa 0.00 0.01 0.40 99.12 0.39 0.05 0.01 0.00 0.01 
Ba 0.00 0.00 0.03 0.63 98.42 0.78 0.04 0.01 0.07 
B 0.00 0.00 0.01 0.03 0.53 98.32 0.75 0.06 0.30 
Caa 0.00 0.00 0.00 0.01 0.02 0.79 97.94 0.41 0.82 
Ca-C 0.00 0.00 0.01 0.00 0.08 0.32 1.42 94.55 3.62 


Default 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 100.00 


482 RISK MANAGEMENT AND FINANCIAL INSTITUTIONS 


The assumption that the credit rating change in one period is independent of that in 
another period is not exactly true. Ifa company has been downgraded recently, it is more 
likely to be downgraded again in the next short period of time.' (This phenomenon is 
sometimes referred to as ratings momentum.) However, the independence assumption is 
not too unreasonable for many purposes” 


21.2 Vasicek’s Model 


The Basel II internal-ratings-based (IRB) capital requirements for credit risk in the bank- 
ing book are based on Vasicek’s Gaussian copula model (see Sections 11.6 and 15.8). This 
is a way of calculating high percentiles of the distribution of the default rate for a port- 
folio of loans. As in Chapter 11, we define WCDR (T, X) as the Xth percentile of the 
default rate distribution during a period of length T. (It will be recalled that WCDR 
denotes worst case default rate.) Vasicek’s model relates WCDR(T, X) to the probability 
of default in time T, PD, and a parameter, p, describing credit correlation. The formula, 
which is proved in Section 11.6, is 


N! (PD N-'!(X 
WCDR(T, X) = N AED F VATA (21.1) 


Vi-p 
For an individual loan, if EAD is the exposure at default and LGD is the loss given default, 
the Xth percentile of the loss distribution is 


WCDR(T, X) x EAD x LGD 


A result by Gordy (2003) enables us to extend this in a one-factor world. If we have 
a large portfolio of n loans where each loan is a small part of the total portfolio, the Xth 
percentile of the loss distribution is approximately 
$, WCDR,(T, X) x EAD; x LGD; (21.2) 
i=1 


where WCDR,(T, X), EAD,, and LGD; are the values of WCDR, EAD, and LGD for 
the ith loan in the portfolio. 


'For a discussion of this, see E. Altman and D. Kao, “The Implications of Corporate Bond Rating 
Drift,’ Financial Analysts Journal (May-June 1992): 64-75, and D. Lando and T. Skodeberg, “Analyzing 
Rating Transitions and Rating Drift with Continuous Observations,” Journal of Banking and Finance 
26 (2002): 423-444. 

? When the five-year transition matrix in Table 21.2 is compared with the actual five-year transition 
matrix published by Moody’, there are some differences. For example, companies in the top three 
rating categories are more likely to default or suffer big downgrades. This is consistent with ratings 
momentum. 

3 See M. Gordy, “A Risk-Factor Model Foundation for Ratings-Based Capital Rules,” Journal of Finan- 
cial Intermediation 12, no. 3 (July 2003): 199-233. 
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As explained in Chapter 15, regulatory capital for the banking book is set equal to the 
expression in equation (21.2) with T equal to one year and X equal to 99.9%. Sometimes 
the expression under the summation sign in equation (21.2) is multiplied by a maturity 
adjustment factor MA,, to allow for the fact that, if it lasts longer than one year, the 
ith loan might deteriorate without defaulting. In the foundation IRB approach, banks 
estimate PDs, but EADs, LGDs, and MAs are set by the Basel II rules. In the advanced 
IRB approach, banks estimate PDs, EADs, LGDs, and MAs. But in all cases the parameter 
p is set by Basel II rules. 

When Vasicek’s model is used to determine economic capital, banks are free to make 
their own estimates of p. Structural models of the type discussed in Section 19.8 can 
be used to show that p for two companies should be roughly equal to the correlation 
between the returns on the assets of the companies.’ As an approximation, this is the 
same as the correlation between the returns on their equities. One way of determining p 
for a portfolio of exposures to companies is therefore to calculate the average correlation 
between the returns on equities for the companies. If the companies are not publicly 
traded, the average correlation between representative publicly traded companies can 
be used instead. Another approach is the maximum likelihood approach described in 
Section 11.6.2. 

As pointed out in Chapter 11, Vasicek’s model has the disadvantage that it incor- 
porates very little tail correlation. Replacing the Gaussian copula model with another 
copula model (see Section 11.6.3 for some alternatives) can remedy this. 


21.3 Credit Risk Plus 


In 1997, Credit Suisse Financial Products proposed a methodology for calculating VaR 
that it termed Credit Risk Plus. It involves analytic approximations that are well estab- 
lished in the insurance industry. 

Suppose that a financial institution has n loans of a certain type and the probability 
of default for each loan during a year is q so that the expected number of defaults is qn. 
Assuming that default events are independent, the probability of m defaults is given by 
the binomial distribution as 


n! mq — oe 
m\(n — my! 1 


If q is small and n large, this can be approximated by the Poisson distribution: 


e 4 (qn) m 


Prob(m defaults) = 
m! 


(21.3) 


4See for example, J. Hull, M. Predescu, and A. White, “The Valuation of Correlation Dependent 
Derivatives Using a Structural Model,” Journal of Credit Risk 6, no. 3 (Fall 2010): 99-132. 
>See Credit Suisse Financial Products, “Credit Risk Management Framework” (October 1997). 
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This equation is approximately true even if the probability of default is not the same for 
each loan, provided that all the probabilities of default are small and q equals the average 
probability of default over the next year for the loans in the portfolio. 

In practice, we are uncertain about what the default rate, q, will be during the next 
year. As shown in Table 11.6, default rates vary greatly from year to year. A convenient 
assumption is that qn, the expected number of defaults, has a gamma distribution with 
mean p and standard deviation ©. 

The Poisson distribution in equation (21.3) then becomes a negative binomial dis- 
tribution 


p Ln a [(m+oa) 
rob(m defaults) = p” (1 — p) Em Dro (21.4) 


where a = p?/67, p = 0° / (p + 0°), and T(x) is the gamma function £ 

Table 21.4 shows the probability distribution for the number of defaults when p = 4 
for various values of o. The first column (o = 0) is calculated from equation (21.3). The 
remaining columns are calculated from equation (21.4). It can be seen that as © tends to 
zero, the negative binomial distribution in equation (21.4) tends to the same probability 
distribution for the number of defaults as the Poisson distribution. As © increases, the 
probability of an extreme outcome involving a large number of defaults increases. Sup- 
pose that the loss from a default is constant. As indicated in the final row of the table, 
when o = 0 so that there is no uncertainty about the default rate, the 99.9% VaR is cal- 
culated as the loss when there are 11 defaults. When o = 10 so that there is a great deal 
of uncertainty about the default rate, the 99.9% VaR is considerably larger. It is the loss 
when there are 98 defaults. 

A more flexible way of building uncertainty into the default rate is by using Monte 
Carlo simulation. Typically there are several different categories of exposures with dif- 
ferent but correlated default rates. One way of proceeding is as follows: 


1. Sample an overall default rate. This could be a random sample from data such as that 
in Table 11.6. 

2. Develop a model relating the default rate for each category of obligors to the overall 
default rate. A simple model is obtained by regressing default rates for each category 
of obligors against the overall default rate. 

. Sample a number of defaults for each category of obligors. 

. Sample a loss given default for each default in each category. 

. Calculate the total loss from the defaults. 

. Repeat steps 1 to 5 many times to build up a probability distribution for the total 


AvA Ww 


loss. 
7. Calculate the required VaR from the total loss probability distribution. 


é Tn Excel, the GAMMALN function returns the natural logarithm of the gamma function, (x). When 
x is an integer (x) = (x — 1)! 
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Table 21.4 Probability Distribution for Number of Defaults When the Expected Number of 
Defaults Is 4.0. 


Standard Deviation of Default Rate 
Number of 


Defaults 0 0.1 0.5 1.0 2.0 5.0 10.0 


0.0183 0.0184 0.0207 0.0281 0.0625 0.2814 0.5938 
0.0733 0.0734 0.0777 0.0901 0.1250 0.1553 0.0913 
0.1465 0.1466 0.1486 0.1531 0.1563 0.1098 0.0509 
0.1954 0.1952 0.1923 0.1837 0.1563 0.0833 0.0353 
0.1954 0.1951 0.1895 0.1746 0.1367 0.0653 0.0268 
0.1563 0.1561 0.1516 0.1396 0.1094 0.0523 0.0214 
0.1042 0.1041 0.1026 0.0978 0.0820 0.0423 0.0177 
0.0595 0.0596 0.0603 0.0614 0.0586 0.0346 0.0150 
0.0298 0.0298 0.0315 0.0353 0.0403 0.0285 0.0129 
0.0132 0.0133 0.0148 0.0188 0.0269 0.0236 0.0113 
0.0053 0.0053 0.0064 0.0094 0.0175 0.0196 0.0099 
0.0019 0.0019 0.0025 0.0045 0.0111 0.0163 0.0088 
0.0006 0.0007 0.0009 0.0020 0.0069 0.0137 0.0079 
0.0002 0.0002 0.0003 0.0009 0.0043 0.0115 0.0071 
0.0001 0.0001 0.0001 0.0004 0.0026 0.0096 0.0064 
0.0000 0.0000 0.0000 0.0001 0.0016 0.0081 0.0058 
0.0000 0.0000 0.0000 0.0001 0.0009 0.0068 0.0053 
0.0000 0.0000 0.0000 0.0000 0.0005 0.0058 0.0048 
0.0000 0.0000 0.0000 0.0000 0.0003 0.0049 0.0044 
0.0000 0.0000 0.0000 0.0000 0.0002 0.0041 0.0041 


PRP PP PP RP RPP 
OMmANDUFWONF TO AN AU FON FY © 


20 0.0000 0.0000 0.0000 0.0000 0.0001 0.0035 0.0038 
99.9 percentile 11 11 12 13 17 39 98 
of number of 

defaults 


There are a number of ways in which the Monte Carlo simulation can be refined. An 
analysis of the data in Table 11.6 shows that the default rate in one year is not independent 
of the default rate in the previous year. This indicates that randomly sampling a default 
rate from a table such as Table 11.6 to determine next year’s default rate may not be the 
best approach. It may be preferable to develop a model that relates the default rate in one 
year to the default rate in the previous year or other economic factors observable in the 
previous year. 

Note that the uncertainty about the default rate plays a key role in the analysis. With- 
out this uncertainty, there is no default correlation and, as indicated in Table 21.4, there 
is very little chance of a large number of defaults. As the uncertainty about the default 
rate increases, default correlation increases and a large number of defaults becomes more 
likely. The default correlation arises because all companies have the same default rate, 
which can be high or low. Without default correlation, the loss probability distribution 
is fairly symmetrical. With default correlation, it is positively skewed, as indicated in 
Figure 21.1. 
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0 Loss 


Figure 21.1 General Shape of Probability Distribution of Default Losses 


21.4 Creditmetrics 


Vasicek’s model and Credit Risk Plus estimate the probability distribution of losses aris- 
ing from defaults. The impact of downgrades is not considered” We now consider the 
CreditMetrics model, which is designed so that it can take account of downgrades as 
well as defaults. This model was proposed by JPMorgan in 1997. It is based on a rating 
transition matrix such as the one shown in Table 21.1. The ratings can be the internal 
ratings used by the bank (derived from its own historical data) or those produced by 
rating agencies. 

Calculating a one-year credit VaR for a portfolio of transactions with many coun- 
terparties involves carrying out a Monte Carlo simulation. On each simulation trial, the 
credit ratings of all counterparties at the end of one year are determined. (The procedure 
for doing this will be explained shortly.) The credit loss for each counterparty is then 
calculated. If the end-of-year credit rating for a counterparty is not default, the calcula- 
tion of the credit loss is given by valuing all the transactions with the counterparty at the 
one-year point. If the end-of-year credit rating is default, the credit loss is the exposure 
at default times one minus the recovery rates 

The calculations require the term structure of credit spreads for each rating category. 
A simple assumption is that it is the same as that observed in the market today. An 
alternative assumption is that there is a credit spread index that has a certain probability 
distribution and that all credit spreads are linearly dependent on the index. 


7 However, the maturity adjustment factor does allow for downgrades in the regulatory implementa- 
tion of Vasicek’s model. 
8 The recovery rate is typically sampled from an empirical distribution. 
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When transactions with a counterparty involve derivatives, an extension of the CVA 
calculations described in Chapter 20 is necessary. Recall that, for each counterparty, a 
CVA calculation involves dividing the time between today and the end of the life of the 
longest transaction with the counterparty into n intervals. We defined q; as the probability 
of default during the ith time interval, v; is the present value of the expected net exposure 
at the midpoint of the interval after collateral has been taken into account, and R is the 
recovery rate. The expected cost of defaults from one year onward, as seen at time zero, is 


n 


= Raw, 


i=j 


where the jth interval starts at time one year. 

On each CreditMetrics simulation trial where there is no default during the first 
year, the term structure of credit spreads at the one-year point gives the probability of 
default in each interval from year one onward using equation (20.3). Suppose that qj is 
the probability of default for the ith interval (i > j) on a particular simulation trial. The 
credit loss for that trial is 


n 


EA- RE = av, (21.5) 


i=j 


Note that, if the counterparty’s credit rating improves during the year, this credit loss is 
likely to be negative. 

If there is a default during the first year on a particular simulation trial, the timing of 
the default is sampled and the exposure at default determined. This is multiplied by one 
minus the recovery rate to determine the loss. 

The results of the CreditMetrics Monte Carlo simulation determine a probability 
distribution for total credit losses arising from defaults and downgrades across all coun- 
terparties. The required value at risk can be calculated from this distribution. 


21.4.1 The Correlation Model 


In sampling to determine credit losses, the credit rating changes for different counter- 
parties are not assumed to be independent. A Gaussian copula model is used to construct 
a joint probability distribution of rating changes. (See Section 11.5 for a discussion of 
copula models.) The copula correlation between the rating transitions for two companies 
is typically set equal to the correlation between their equity returns. (The capital asset 
pricing model discussed in Chapter 1 provides a one-factor model for equity returns. 
As described in Section 11.4.2, this can be used to determine an internally consistent 
correlation matrix.) 

Suppose that we are simulating the rating change of an A-rated and a B-rated com- 
pany over a one-year period using the transition matrix in Table 21.1. Suppose that the 
correlation between the equity returns of the two companies is 0.2. On each simula- 
tion trial we would sample two variables x, and x, from standard normal distributions 
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so that their correlation is 0.2. The variable x, determines the new rating of the A- 
rated company and variable x, determines the new rating of the B-rated company. From 
Table 21.1, the probability of an A-rated company moving to Aaa, Aa, A,... is 0.0006, 
0.0264, 0.9090, and so on. Because 


N7'(0.0006) = —3.2389 
N7!(0.0006 + 0.0264) = —1.9268 
N7!(0.0006 + 0.0256 + 0.9090) = 1.5220 


the A-rated company gets upgraded to Aaa if x4 < —3.2389, it becomes Aa-rated 
if —3.2389 < x, < —1.9268, it stays A-rated if —1.9268 < x, < 1.5220, and so on. 
Table 21.1 also shows that the probability of a B-rated company moving to Aaa, Aa, 
A,... is 0.0001, 0.0003, 0.0016, and so on. Because 


N7!(0.0001) = —3.7190 
N~!(0.0001 + 0.0003) = —3.3528 
N7!(0.0001 + 0.0003 + 0.0016) = —2.8782 


the B-rated becomes Aaa-rated if xg < —3.7190, it becomes Aa-rated if —3.7190 < xg < 
—3.3528, it becomes A-rated if —3.3528 < xp < —2.8782, and so on. The A-rated com- 
pany defaults if x, > N~'(0.9994), that is, when x4 > 3.2389. The B-rated company 
defaults when xg > N7~'(0.9621), that is, when x, > 1.7756. This example is illustrated 
in Figure 21.2. 

It is interesting to note that, if both the CreditMetrics and the Credit Risk Plus 
models were based on the same set of assumptions, they should in theory predict the 
same probability distribution for losses over the long term. It is the timing of losses that 
is different. Suppose, for example, that you hold a certain loan in your portfolio. In year 1, 
the borrower gets downgraded from A to BBB; in year 2, it gets downgraded from BBB 
to B; in year 3 it defaults. You could assume that there are no losses in years 1 and 2 and 
calculate loss in year 3 (the Credit Risk Plus approach). Alternatively, you can calculate 
separate revaluation losses in years 1, 2, and 3 (the CreditMetrics approach). The losses 
under the second approach should in theory add up to the losses under the first approach. 


21.5 Credit Spread Risk 


The values of most credit-sensitive products in the trading book depend critically on 
credit spreads. Calculating VaR or ES for portfolios including these products therefore 
involves examining potential credit spread changes. One possibility is to use historical 
simulation to calculate a one-day 99% VaR. This one-day 99% VaR can then be scaled 
up by v10 to obtain a 10-day 99% VaR. It is first necessary to collect credit spread 
changes for each company to which the bank is exposed over the last 500 days (or some 
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X4< 3.2389 Kan xp< -3.7190 Kai 
—3.2389 < x, < -1.9268 —3.7190 < xp < -3.3528 
>» Aa a 
—1.9268 < x, < 1.5220 —3.3528 < xp < —2.8782 
> A > 
1.5220 < x4 < 2.4422 —2.8782 < xp < -2.4522 
>» Baa >» Baa 
A B > 
2.4422 < x, < 2.8480 —2.4522< xp < -1.5523 
—________» Ba >» Ba 
2.8480 < x, < 3.0902 —1.5523 < x< 1.1856 
> B B 
3.0902< x4 < 3.2389 Caa 1.1856 < xg< 1.7060 Caa 
3.2389<x, <3.2905 1.7060<xg <1.7756 
Ca-C CHC 
x, > 3.2905 Xp> 1.7756 
> Default > Default 


Figure 21.2 The CreditMetrics Correlation Model: Transition of A-Rated and B-Rated 
Companies to a New Rating after One Year (x4 and xp are sampled from standard normal 
distributions with the correlation equal to the correlation between the equity returns of A and B) 


other period). The first historical simulation scenario assumes that the percentage credit 
spread changes for all companies are the same as they were on the first day; the second 
scenario assumes that they are the same as they were on the second day; and so on. 
However, there are problems with this approach. If the company is alive today, it did not 
default in the past and the calculations carried out therefore assume no probability of 
default in the future.” Another problem is that credit spreads are not updated daily for all 
companies and so the historical data may not be of high quality. 

An alternative approach is to use a version of the CreditMetrics approach. A 10-day 
rating transitions matrix is calculated as indicated in Section 21.1. The ratings transition 
matrix defines the probability of a company moving from one rating category to another 
during a 10-day period. The historical data on credit rating changes define a probability 
distribution for the credit spread changes associated with each rating category at the end 
of 10 days. A Monte Carlo simulation is carried out. On each trial, the transition matrix 
is sampled from to determine whether each company stays in the same rating category, 
changes to a different rating category, or defaults. The probability distribution of credit 
spread changes is also sampled from to determine the credit spread at the end of the 


? One way of dealing with this is to assume that default takes place when the credit spread exceeds a 
certain level. Another is to separately consider “jump-to-default” risk, which is the approach taken by 
regulators. 
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10 days associated with each rating category. This enables the value of the portfolio at 
the end of 10 days to be calculated on each trial and a VaR to be obtained. 

Credit correlation can be introduced in two ways. First, a Gaussian copula model 
can be used to correlate the credit rating changes of different companies, as explained 
in Section 21.4.1. Second, the credit spread changes for different rating categories can 
be assumed to be perfectly (or nearly perfectly) correlated so that when the spreads for 
A-rated instruments move up, those for instruments with other ratings do as well. 


Example 21.1 


Consider a company that owns a single two-year zero-coupon bond with a principal of 
$1,000. Suppose that the risk-free rate is 3% and the current credit spread is 200 basis 
points so that the bond yield is 5%. (Rates are expressed with annual compounding.) The 
current price of the bond is 1,000/1.057 = $907.03. Suppose that the bond’s current 
rating is Ba and, approximating the numbers in Table 21.3, that during the next month 
there is a 0.6% chance that it will increase to Baa, a 98.5% chance that it will stay the 
same, a 0.8% chance that it will decrease to B, and a 0.1% chance that it will default. If it 
defaults, the bond will be worth $400. For each possible rating category there are three 
equally likely credit spreads. In basis points these are 80, 100, and 120 for Baa; 160, 200, 
and 240 for Ba; and 400, 450, and 500 for B. 

The worst outcome is clearly a default. This has a probability of 0.1% and will lead 
to a loss of 907.03 — 400 = $507.03. The next worst outcome involves a transition to a 
rating of B when the credit spread for that rating category is 500 basis points. This has a 
probability of 0.8/3 = 0.267%. Because the bond will have about 1.917 years to maturity, 
its price will be 1,000/1.08!?'” = $862.85, for a loss of 907.03 — 862.85 = $44.17. A 
complete list of outcomes with their probabilities is shown in Table 21.5. 


Table 21.5 Outcomes for Example 21.1 
Rating Spread (bp) Probability Bond Value ($) Loss ($) 


Default 0.100% 400.00 507.03 
B 500 0.267% 862.85 44.17 
B 450 0.267% 870.56 36.47 
B 400 0.267% 878.38 28.65 
Ba 240 32.833% 904.11 2.92 
Ba 200 32.833% 910.72 —3.70 
Ba 160 32.833% 917.41 —10.38 
Baa 120 0.200% 924.17 —17.14 
Baa 100 0.200% 927.58 —20.55 
Baa 80 0.200% 931.01 —23.98 


The table shows that, if the confidence level is higher than 99.9%, the VaR is $507.03; 
if it is between 99.9% and 99.633%, the VaR is $44.17; and so on. When the confidence 
level is 99%, the VaR is a modest $2.92. 
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21.5.1 Constant Level of Risk Assumptions 


When calculating VaR or ES, banks sometimes make a constant level of risk assumption. 
This means that they assume a portfolio is rebalanced periodically to bring the risk back 
to where it was initially. 

Suppose, for example, that a company invests in Baa bonds and is considering two 
alternative trading strategies: 


1. Hold bonds for one year and then sell (the “buy and hold” strategy). 
2. Rebalance at the end of each month so that if bonds are no longer Baa they are sold 
and replaced with Baa bonds (the “constant level of risk” strategy). 


The first strategy leads to outcomes given by the Baa row of Table 21.1, while, in 
the case of the second strategy, the outcomes given by the Baa row of Table 21.3 are 
repeated 12 times. 

Consider first defaults. From Table 21.1, the probability of a default under the buy- 
and-hold strategy is 0.0018. From Table 21.3, the probability of a default each month is 
0.0001215.!° The probability of one default when the constant level of risk strategy is 
used is therefore 


12 x 0.0001215 x 0.9998785!! = 0.001456 


This is 19% less than that under the buy-and-hold strategy. The probability of two or 
more defaults is negligible. 

Consider next the probability of a downgrade to C. Under the buy-and-hold strategy, 
the probability of this is 0.0017 from Table 21.1. When the constant level of risk strategy 
is used, the probability of one downgrade to C is 


12 x 0.0001277 x 0.9998723!! = 0.001530 


which is a 10% reduction over the corresponding number for the buy-and-hold strategy. 
The probability of more than one downgrade to C is negligible. 

Finally, consider the probability of a downgrade to Ba. Under the buy-and-hold 
strategy, the probability of this is 0.0409 from Table 21.1. When the constant level of risk 
strategy is used, the probability of one downgrade to Ba is 


12 x 0.0038735 x 0.9961265!! = 0.045505 


which is a 10% increase over the buy-and-hold strategy. The probability of more than 
one downgrade in this case is about 0.001. 


10 Note that these results are based on numbers with more decimal places than those in Table 21.3. 
See results on the author’s website. 
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The calculations illustrate that the buy-and-hold strategy will lead to greater losses 
from defaults and big downgrades than the constant level of risk strategy. But the prob- 
abilities of losses from a small rating change are less. VaR and ES are generally smaller 
when the constant level of risk strategy is used. 


Summary 


Credit VaR is defined similarly to the way VaR is defined for market risk. It is the credit 
loss that will not be exceeded over some time horizon with a specified confidence level. 
Basel regulations require credit VaR to be calculated with a one-year time horizon and a 
99.9% confidence level. For economic capital purposes a different confidence level may 
be used. 

We have discussed three methods for calculating credit VaR: Vasicek’s model, Credit 
Risk Plus, and CreditMetrics. Vasicek’s model is based on a one-factor Gaussian copula 
model of the time to default for the companies in a portfolio. It is used in the determina- 
tion of regulatory capital. Credit Risk Plus uses procedures that are similar to those used 
in the insurance industry to calculate default losses from assumptions about the proba- 
bilities of default of individual companies. CreditMetrics is different from the other two 
methods in that it considers losses from both defaults and downgrades. It uses a Gaussian 
copula model in conjunction with a ratings transition matrix. 

To determine VaR or ES for instruments dependent on credit spreads, one approach 
is to collect historical data on the credit spread changes of companies and use a historical 
simulation methodology similar to that for market risk. Another is to model the rating 
transitions of companies and movements in the average credit spreads associated with 
different rating categories. 


Further Reading 
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Finger, C. C. “Creditmetrics and Constant Level of Rusk.’ Working Paper, RiskMetrics Group-MSCI, 
September 2010. 

Gordy, M. “A Raisk-Factor Model Foundation for Ratings-Based Capital Rules.” Journal of Financial 
Intermediation 12, no. 3 (July 2003): 199-233. 

JP Morgan. “CreditMetrics—Technical Document” (April 1997). 

Vasicek, O. “Probability of Loss on a Loan Portfolio.” Working Paper, KMV, 1987. (Published in Risk 
in December 2002 under the title “Loan Portfolio Value.’) 


Practice Questions and Problems 
(Answers at End of Book) 


21.1 Explain the difference between Vasicek’s model, the Credit Risk Plus model, and 
CreditMetrics as far as the following are concerned: (a) when a credit loss is rec- 
ognized and (b) the way in which default correlation is modeled. 
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21.2 
21.3 


21.4 


21.5 


21.6 


21.7 


21.8 


21.9 


21.10 


21.11 


Explain what is meant by the constant level of risk assumption. 

Use the transition matrix in Table 21.1 to calculate a transition matrix for two 
years. What is the probability of a company rated Aaa staying Aaa during the two 
years? What is the probability of it moving to Aa? 

Use the transition matrix in Table 21.1 and software on the author’s website to 
calculate a transition matrix for six months. What is the probability of a company 
rated Aaa staying Aaa during the six months? What is the probability of it moving 
to Aa? 

How can historical simulation be used to calculate a one-day 99% VaR for the 
credit risk of bonds in the trading book? What are the disadvantages? 

A bank has 100 one-year loans, each with a 1% probability of default. What is the 
probability of six or more defaults, assuming independence? 

Repeat Problem 21.6 on the assumption that the probability of default applicable 
to all loans is uncertain. It is equally likely to be 0.5% or 1.5%. 

What is the autocorrelation for the default rates in Table 11.6, which can also be 
found on a spreadsheet on the author’s website? What are the implications of this 
for a Credit Risk Plus model? 


Further Questions 


Explain carefully the distinction between real-world and risk-neutral default prob- 
abilities. Which is higher? A bank enters into a credit derivative where it agrees to 
pay $100 at the end of one year if a certain company’s credit rating falls from A to 
Baa or lower during the year. The one-year risk-free rate is 5%. Using Table 21.1, 
estimate a value for the derivative. What assumptions are you making? Do they 
tend to overstate or understate the value of the derivative? 

Suppose that a bank has a total of $10 million of small exposures of a certain 
type. The one-year probability of default is 1% and the recovery rate averages 
40%. Estimate the 99.5% one-year credit VaR using Vasicek’s model if the copula 
correlation parameter is 0.2. 

Use the transition matrix in Table 21.1 and software on the author’s website to 
calculate the transition matrix over 1.25 years. 


Part Five 


OTHER TOPICS 


Chapter 22 


Scenario Analysis and 
Stress Testing 


tress testing involves evaluating the impact of extreme, but plausible, scenarios that 
S are not considered by value at risk (VaR) or expected shortfall (ES) models. If there 

is one lesson to be learned from the market turmoil that started in the summer 
of 2007, it is that more emphasis should be placed on stress testing and less emphasis 
should be placed on the mechanistic application of VaR/ES models. VaR /ES models are 
useful, but they are inevitably backward looking. Risk management is concerned with 
what might happen in the future. 

This chapter considers the different approaches that can be used to generate scenarios 
for stress testing and how the results should be used. It explains that the financial crisis 
of 2007-2008 has caused regulators to require banks to conduct more stress testing and 
that increasingly regulators are defining the stress tests themselves in an attempt to ensure 
that banks have sufficient capital to withstand adverse scenarios. 


22.1 Generating the Scenarios 


The most popular approach for calculating market risk VaR or ES is the historical sim- 
ulation approach covered in Chapter 13. This approach assumes that data from the past 
are a good guide to what will happen over the next 1 to 10 days. But if an event has not 
occurred during the period covered by the data, it will not affect the results when the 
usual historical simulation method is used. 
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We have discussed a number of ways VaR/ES calculations can be modified so that 
they reflect more than the simple assumption that the future movements in market vari- 
ables will be a random sample from the recent past. In particular: 


1. Volatility scaling (see Section 13.3) can lead to more extreme outcomes being con- 
sidered when the market is highly volatile. 

2. Extreme value theory (see Section 13.5) provides a way of extending the tails of the 
loss distribution obtained from historical data. 

3. Calculating stressed VaR or stressed ES (see Section 13.1) considers the impact of a 
particularly difficult period of 250 days that has occurred in the past. 


But the nature of a VaR/ES calculation is that it is backward looking. Events that could 
happen, but are quite different from those that occurred during the period covered by 
the data, are not taken into account. Stress testing is an attempt to overcome this weakness 
of the VaR/ES measure. 

Stress testing involves estimating how the portfolio of a financial institution would 
perform under scenarios involving extreme (but plausible) market moves. The exchange 
rate example in Table 10.1 illustrates the occurrence of extreme market moves. It shows 
that five-standard-deviation moves happen about every five years on average, whereas 
they would happen hardly ever (about once every 7,000 years) if the distributions were 
normal. 

A key issue in stress testing is the way in which scenarios are chosen. We now consider 
alternative procedures. 


22.1.1 Stressing Individual Variables 


One approach is to use scenarios where there is a large move in one variable and 
other variables are unchanged. Examples of scenarios of this type that are sometimes 
considered are: 


1. A 100-basis-point parallel shift (up or down) in a yield curve. 

2. Increasing or decreasing all the implied volatilities used for an asset by 50% of current 
values. 

3. Increasing or decreasing an equity index by 10%. 

4. Increasing or decreasing the exchange rate for a major currency by 6%. 

5. Increasing or decreasing the exchange rate for a minor currency by 20%. 


The impact of small changes in a variable is measured by its delta, as explained in 
Chapter 8. The impact of larger changes can be measured by a combination of delta and 
gamma. Here we are considering changes that are so large that it is likely to be unreliable 
to estimate the change in the value of a portfolio using Greek letters. 
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22.1.2 Scenarios Involving Several Variables 


Usually, when one market variable shows a big change, others do as well. This has led 
financial institutions to develop scenarios where several variables change at the same time. 
A common practice is to use extreme movements in market variables that have occurred 
in the past. For example, to test the impact of an extreme movement in U.S. equity 
prices,a company might set the percentage changes in all market variables equal to those 
on October 19, 1987 (when the S&P 500 moved by 22.3 standard deviations). If this is 
considered to be too extreme, the company might choose January 8, 1988 (when the S&P 
500 moved by 6.8 standard deviations). Other dates when there were big movements in 
equity prices are September 11, 2001, when terrorists attacked the World Trade Center 
in New York, and September 15, 2008, when Lehman Brothers declared bankruptcy. 
To test the effect of extreme movements in UK interest rates, the company might set 
the percentage changes in all market variables equal to those on April 10, 1992 (when 
10-year bond yields moved by 8.7 standard deviations). 

Another approach is to magnify what has happened in the past to generate extreme 
scenarios. For example, we might take a period of time when there were moderately 
adverse market movements and create a scenario where all variables move by three or 
five times as much as they did then. The problem with this approach is that correla- 
tions increase in stressed market conditions, and increasing the movements in all market 
variables by a particular multiple does not increase correlation. 

Some historical scenarios are one-day shocks to market variables. Others, particularly 
those involving credit and liquidity, involve shocks that take place over several days, several 
weeks, or even several months. It is important to include volatilities in the market variables 
that are considered. Typically, extreme movements in market variables such as interest 
rates and exchange rates are accompanied by large increases in the volatilities of these 
variables and large increases in the volatilities of a wide range of other variables. Some 
scenarios are likely to involve big movements in commodity prices, such as the plunge in 
oil prices in the second half of 2014. Others may involve a situation where there is a 
flight to quality combined with a shortage of liquidity and an increase in credit spreads. 
This was what happened in August 1998 when Russia defaulted on its debt and in 
July and August 2007 when investors lost confidence in the products created from the 
securitization of subprime mortgages (see Chapter 6). 


22.1.3 Scenarios Generated by Management 


History never repeats itself exactly. This may be partly because traders are aware of past 
financial crises and try to avoid making the same mistakes as their predecessors. The U.S. 
mortgage market led to the credit crisis that started in 2007. It is unlikely that future 
credit crises will be a result of mortgage-lending criteria being relaxed—but it is likely 
that there will be credit crises in the future. 

In many ways, the scenarios that are most useful in stress testing are those generated 
by senior management or by the economics group within a financial institution. Senior 
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management and the economics group are in a good position to use their understanding 
of markets, world politics, the economic environment, and current global uncertainties 
to develop plausible scenarios that would lead to large losses. Sometimes, the scenarios 
produced are based on things that have happened in the past, but are adjusted to include 
key features of the current financial and economic environment. 

One way of developing the scenarios is for a committee of senior management 
to meet periodically and “brainstorm” answers to the simple question: “What can go 
wrong?” Clemens and Winkler (1999) have done studies to investigate the optimal com- 
position ofa committee of this type.! Their conclusions are that (a) the committee should 
have three to five members, (b) the backgrounds of the committee members should be 
different, and (c) there should be a healthy dialogue between members of the committee. 
It is important that members of the committee are able step back from their day-to-day 
responsibilities to consider the big picture. 

It is not always the case that senior management’s thinking has to be highly innovative 
in order for it to come up with relevant scenarios. In 2005 and 2006, many commentators 
realized that the U.S. housing market was experiencing a bubble and that sooner or later 
the bubble would burst. It is easy to be wise after the event, but one reasonable scenario 
for the stress-testing committee to propose during that period would have been a 20% 
or 30% decline in house prices in all parts of the country. 

It is important that senior management and the board of directors understand and 
recognize the importance of stress testing. They have the responsibility for taking strate- 
gic decisions based on the stress-testing results. One advantage of involving senior man- 
agement in the development of the scenarios to be used in stress testing is that it should 
naturally lead to a “buy in” by them to the idea that stress testing is important. The results 
generated from scenarios that are created by individuals who have middle management 
positions are unlikely to be taken as seriously. 


22.1.4 Core vs. Peripheral Variables 


When individual variables are stressed or scenarios are generated by management, the 
scenarios are likely to be incomplete in that movements of only a few (core) market 
variables are specified. One approach is to set changes in all other (peripheral) variables 
to zero, but this is likely to be unsatisfactory. Another approach is to regress the periph- 
eral variables on the core variables that are being stressed to obtain forecasts for them 
conditional on the changes being made to the core variables. These forecasts (as point 
forecasts or probability distributions) can be incorporated into the stress test. 

This is known as conditional stress testing and is discussed by Kupiec (1999)? Kim 
and Finger (2000) carry this idea further by using what they call a “broken arrow” 
stress test. In this, the correlation between the core variables and the peripheral variables 


"See R. Clemens and R. Winkler, “Combining Probability Distributions from Experts in Risk 
Analysis,” Risk Analysis 19, no. 2 (April 1999): 187-203. 
?P Kupiec, “Stress Testing in a Value at Risk Framework,” Journal of Derivatives 6 (1999): 7-24. 
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is based on what happens in stressed market conditions rather than what happens on 
average. 


22.1.5 Making Scenarios Complete 


Scenarios should be carefully examined in an attempt to make sure that all the adverse 
consequences are considered. The scenarios should include not only the immediate effect 
on the financial institution’s portfolio of a shock to market variables, but also any “knock- 
on” effect resulting from many different financial institutions being affected by the shock 
in the same way and responding in the same way. Many people have said that they 
recognized the real estate bubble in the United States would burst in 2007, but did not 
realize how bad the consequences would be. They did not anticipate that many financial 
institutions would experience losses at the same time with the result that there would be 
a flight to quality with severe liquidity problems and a huge increase in credit spreads. 

Another example of knock-on effects is provided by the failure of Long-Term 
Capital Management (LTCM) in 1998 (see Business Snapshot 22.1). LTCM tended to 
have long positions in illiquid securities and short positions in liquid securities. Its failure 
was caused by a flight to quality following Russia’s default on its debt. Many investors 
were only interested in buying liquid securities. Spreads between liquid and illiquid secu- 
rities increased. LTCM had done stress tests looking at the impact of flights to quality 
similar to those that had occurred pre-1998. What it did not allow for was the knock-on 
effect. Many hedge funds were following similar trading strategies to those of LTCM in 
1998. When the flight to quality started, they were all forced to unwind their positions at 
the same time. Unwinding meant selling illiquid securities and buying liquid securities, 
reinforcing the flight to quality and making it more severe than previous flights to quality. 

Scenarios should ideally be dynamic so that the response to the scenario of the finan- 
cial institution doing the stress test, as well as the responses of other financial institutions, 
are considered. Consider, for example, the situation where a financial institution has sold 
call options dependent on an underlying asset and maintains delta neutrality. A shock 
where there is a large increase in the asset price will lead to an immediate loss on the 
option position. To maintain delta neutrality (see Section 8.1), large amounts of the asset 
will have to be bought at a relatively high price. The cost of subsequent delta hedging 
is liable to depend on the volatility of the asset price. Often the volatility will be high, 
with big increases and decreases in the asset price before it settles down. The subsequent 
delta hedging is then particularly expensive. 


22.1.6 Reverse Stress Testing 


Reverse stress testing involves the use of computational procedures to search for scenarios 
that lead to a failure of the financial institution. It has become an important tool in risk 
management, and many bank supervisory bodies now require banks to have reverse stress 
testing programs in place. 


3See J. Kim and C. C. Finger, “A Stress Test to Incorporate Correlation Breakdown,” Journal of Risk 
2, no. 3 (Spring 2000): 5—19. 
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BUSINESS SNAPSHOT 22.1 
Long-Term Capital Management’s Big Loss 


Long-Term Capital Management (LTCM), a hedge fund formed in the mid- 
1990s, always collateralized its transactions. The hedge fund’s investment strategy 
was known as convergence arbitrage. A very simple example of what it might do 
is the following. It would find two bonds, X and Y, issued by the same company 
promising the same payoffs, with X being less liquid (i.e., less actively traded) 
than Y. The market always places a value on liquidity. As a result, the price of 
X would be less than the price of Y. LTCM would buy X, short Y, and wait, 
expecting the prices of the two bonds to converge at some future time. 

When interest rates increased, the company expected both bonds to move 
down in price by about the same amount so that the collateral it paid on bond X 
would be about the same as the collateral it received on bond Y. Similarly, when 
interest rates decreased, LTCM expected both bonds to move up in price by 
about the same amount so that the collateral it received on bond X would be 
about the same as the collateral it paid on bond Y. It therefore expected that 
there would be no significant outflow of funds as a result of its collateralization 
agreements. 

In August 1998, Russia defaulted on its debt and this led to what is termed a 
“flight to quality” in capital markets. One result was that investors valued liquid 
instruments more highly than usual and the spreads between the prices of the 
liquid and illiquid instruments in LT'CM’s portfolio increased dramatically. The 
prices of the bonds LTCM had bought went down and the prices of those it had 
shorted increased. It was required to post collateral on both. The company was 
highly leveraged and found it difficult to make the payments required under the 
collateralization agreements. The result was that positions had to be closed out 
and there was a total loss of about $4 billion. If the company had been less highly 
leveraged it would probably have been able to survive the flight to quality and 
could have waited for the prices of the liquid and illiquid bonds to become closer. 


Example 22.1 


As a simple example of reverse stress testing, suppose a financial institution has positions 
in four European call options on an asset. The asset price is $50, the risk-free rate is 3%, 
the volatility is 20%, and there is no income on the asset. The positions, strike prices, and 
lives of the options are as indicated in the table below. The current value of the position 
(in millions of dollars) is —25.90. A numerical procedure can be used to search for one- 
day changes in the asset price and the volatility that will lead to the greatest losses. Some 
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bounds should be put on the changes that are considered. We assume that the asset price 
will not decrease below $40 or increase above $60. It is assumed that the volatility will 
not fall below 10% or rise above 30%. 


Position (000s) Strike Price Life (years) Position Value ($ millions) 
+250 50 1.0 1,176.67 
—125 60 1.5 —293.56 
—75 40 0.8 —843.72 
—50 55 0.5 —65.30 
Total —25.90 


Using the formulas for valuing European options (see Appendix E) in conjunction 
with Excel’s Solver, the worst loss is found to be when the volatility decreases to 10% and 
the asset price falls to $45.99. The loss is $289.38 million. Reverse stress testing therefore 
shows that the financial institution is most exposed to a reduction of about 8% in the 
asset price combined with a sharp decline in volatility. 

This might seem to be an unreasonable scenario. It is unlikely that volatility would 
go down sharply when the asset price declines by 8%. Solver could be run again with 
the lower bound to volatility being 20% instead of 10%. This gives a worst-case loss 
occurring when the volatility stays at 20% and the asset price falls to $42.86. The loss is 
then $87.19 million. 


Searching over all the market variables to which a financial institution is exposed in 
the way indicated in Example 22.1 is in practice usually not computationally feasible. 
One approach is to identify 5 to 10 key market variables and assume that changes in 
other variables are dependent on changes in these variables. 

Another way of simplifying the search process is to impose some structure on the 
problem. A principal components analysis (see Section 9.7) can be carried out on the 
changes in market variables (ideally using data from stressed market conditions) and then 
a search can be conducted to determine the changes in the principal components that 
generate large losses. This reduces the dimension of the space over which the search is 
conducted and should lead to fewer implausible scenarios. 

An alternative approach is for the risk management group to impose a structure on 
scenarios. For example, a number of adverse scenarios that have occurred in the past can 
be specified. An analyst can then search to find what multiplier must be applied to the 
changes observed in all market variables under the scenarios in order for a particular 
loss level to be reached. For example, it might be concluded that a financial institution 
could survive a repeat of market movements during the credit crisis of 2008, but that if 
movements were 50% greater there would be serious problems. 

Reverse stress testing can be used as a tool to facilitate brainstorming by the stress- 
testing committee. Prior to a meeting of the stress-testing committee, analysts can use 
reverse stress testing to come up with several scenarios that would be disastrous to the 
financial institution. These scenarios, along with others they generate themselves, are 
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then considered by the stress-testing committee. They use their judgment to eliminate 
some of the analysts’ scenarios as implausible and modify others so that they become 
plausible and are retained for serious evaluation. 


22.2 Regulation 


The Basel Committee requires market risk calculations that are based on a bank’s internal 
VaR models to be accompanied by “rigorous and comprehensive” stress testing. Simi- 
larly, banks using the IRB approach in Basel II (advanced or foundation) to determine 
credit risk capital are required to conduct stress tests to determine the robustness of their 
assumptions. 

In May 2009, the Basel Committee issued the final version of its recommendations 
on stress-testing practices and how stress testing should be supervised by regulators.4 The 
recommendations emphasize the importance of stress testing in determining how much 
capital is necessary to absorb losses should large shocks occur. They make the point that 
stress testing is particularly important after long periods of benign conditions because 
such conditions tend to lead to complacency. 

The recommendations highlight the importance of top management and board 
involvement in stress testing. In particular, top management and board members should 
be involved in setting stress-testing objectives, defining scenarios, discussing the results of 
stress tests, assessing potential actions, and decision making. It makes the point that the 
banks that fared well in the financial crisis that started in mid-2007 were the ones where 
senior management as a whole took an active interest in the development and operation 
of stress testing, with the results of stress testing serving as an input into strategic decision 
making. Stress testing should be conducted across all areas of the bank. It should not be 
the case that each area conducts its own stress test. 

The Basel recommendations make the point that many of the scenarios chosen pre- 
2007 were based on historical data and much less severe than what actually happened. 
Specific recommendations for banks are: 


1. Stress testing should form an integral part of the overall governance and risk manage- 
ment culture of the bank. Stress testing should be actionable, with the results from 
stress-testing analyses impacting decision making at the appropriate management 
level, including strategic business decisions of the board and senior management. 
Board and senior management involvement in the stress-testing program is essential 
for its effective operation. 

2. A bank should operate a stress-testing program that promotes risk identification and 
control, provides a complementary risk perspective to other risk management tools, 
improves capital and liquidity management, and enhances internal and external com- 
munication. 


+See “Principles for Sound Stress-Testing Practices and Supervision,” Basel Committee on Banking 
Supervision, May 2009. 
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10. 


11. 


12. 


13. 


14. 


15. 


. Stress-testing programs should take account of views from across the organization 


and should cover a range of perspectives and techniques. 


. A bank should have written policies and procedures governing the stress-testing pro- 


gram. The operation of the program should be appropriately documented. 


. A bank should have a suitably robust infrastructure in place, which is sufficiently 


flexible to accommodate different and possibly changing stress tests at an appropriate 
level of granularity. 


. A bank should regularly maintain and update its stress-testing framework. The effec- 


tiveness of the stress-testing program, as well as the robustness of major individual 
components, should be assessed regularly and independently. 


. Stress tests should cover a range of risks and business areas, including at the firmwide 


level. A bank should be able to integrate effectively across the range of its stress-testing 
activities to deliver a complete picture of firmwide risk. 


. Stress-testing programs should cover a range of scenarios, including forward-looking 


scenarios, and aim to take into account system-wide interactions and feedback effects. 


. Stress tests should feature a range of severities, including events capable of generating 


most damage whether through size of loss or through loss of reputation. A stress- 
testing program should also determine what scenarios could challenge the viability 
of the bank (reverse stress tests) and thereby uncover hidden risks and interactions 
among risks. 

As part of an overall stress-testing program, a bank should aim to take account of 
simultaneous pressures in funding and asset markets, and the impact of a reduction 
in market liquidity on exposure valuation. 

The effectiveness of risk mitigation techniques should be systematically challenged. 
The stress-testing program should explicitly cover complex and bespoke products 
such as securitized exposures. Stress tests for securitized assets should consider the 
underlying assets, their exposure to systematic market factors, relevant contractual 
arrangements and embedded triggers, and the impact of leverage, particularly as it 
relates to the subordination level in the issue structure. 

The stress-testing program should cover pipeline and warehousing risks. A bank 
should include such exposures in its stress tests regardless of their probability of being 
securitized. 

A bank should enhance its stress-testing methodologies to capture the effect of repu- 
tational risk. The bank should integrate risks arising from oft-balance-sheet vehicles 
and other related entities in its stress-testing program. 

A bank should enhance its stress-testing approaches for highly leveraged counterpar- 
ties in considering its vulnerability to specific asset categories or market movements 
and in assessing potential wrong-way risk related to risk-mitigating techniques. 


5 “Pipeline and warehousing” risks refer to risks associated with assets that are awaiting securitization, 


but might not be securitized if market conditions change. These risks led to losses during the onset 


of the crisis. 
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The recommendations for bank supervisors are: 


16. Supervisors should make regular and comprehensive assessments of a bank’s stress- 
testing programs. 

17. Supervisors should require management to take corrective action if material defi- 
ciencies in the stress-testing program are identified or if the results of stress tests are 
not adequately taken into consideration in the decision-making process. 

18. Supervisors should assess and, if necessary, challenge the scope and severity of firm- 
wide scenarios. Supervisors may ask banks to perform sensitivity analysis with respect 
to specific portfolios or parameters, use specific scenarios, or evaluate scenarios under 
which their viability is threatened (reverse stress-testing scenarios). 

19. Under the Pillar 2 (supervisory review process) of the Basel II framework, supervisors 
should examine a bank’s stress-testing results as part of a supervisory review of both 
the bank’s internal capital assessment and its liquidity risk management. In particular, 
supervisors should consider the results of forward-looking stress testing for assessing 
the adequacy of capital and liquidity. 

20. Supervisors should consider implementing stress-test exercises based on common 
scenarios. 

21. Supervisors should engage in a constructive dialogue with other public authorities 
and the industry to identify systemic vulnerabilities. Supervisors should also ensure 
that they have the capacity and the skills to assess banks’ stress-testing programs. 


22.2.1 Scenarios Chosen by Regulators 


Bank regulators require banks to consider extreme scenarios and then make sure they 
have enough capital for those scenarios. There is an obvious problem here. Banks want 
to keep their regulatory capital as low as possible. They therefore have no incentive to 
consider extreme scenarios that would lead to a bank supervisor telling them that their 
capital requirements need to be increased. There is therefore a natural tendency for the 
scenarios they consider to be “watered down” and fairly benign. 

One approach to overcoming this problem is for regulators themselves to provide the 
scenarios (see Recommendations 18 and 20). Regulators in many jurisdictions (including 
the United Kingdom, the European Union, and the United States) now regularly carry 
out their own stress tests to determine whether the banks they are supervising have 
enough capital to survive extreme adverse events. If banks fail the stress test, they are 
required to raise more capital and may be subject to dividend restrictions. 

In the United States, the Federal Reserve has since 2009 carried out an annual stress 
test for all banks with consolidated assets of more than $50 billion (i.e., G-SIBs and 
D-SIBs).Since 2011 this has been called the Comprehensive Capital Analysis and Review 
(CCAR). Banks are required to submit a capital plan (including planned dividends). 
The scenarios created by the Federal Reserve are recession scenarios similar to those 
in 1973-1975, 1981-1982, and 2007—2009. The scenarios include projections for about 
25 variables, including macroeconomic variables such as gross domestic product (GDP) 
growth, unemployment rate, stock market indices, and house price indices. 
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Banks in the United States with consolidated assets over $10 billion are subject to the 
Dodd—Frank Act Stress Test (DFAST). The scenarios considered in DFAST are similar to 
those in CCAR. However, banks do not have to submit a capital plan, as the management 
of capital is assumed to be based on a standard set of assumptions. Both CCAR and 
DFAST involve a huge amount of work for banks with thousands of pages of justification 
for the models used being produced. 

By choosing scenarios themselves, regulators are able to focus the attention of banks 
on recession scenarios that are of concern to regulators. If regulators see many banks 
taking positions with similar risks, they can require all banks to consider a particular set 
of scenarios that give rise to adverse results for the positions. The downside of regu- 
lators generating scenarios themselves is that part of the reason for the increased focus 
by supervisors on stress testing is that they want to encourage financial institutions to 
spend more time generating and worrying about potential adverse scenarios. If supervi- 
sors do the work in generating the scenarios, this may not happen and banks will focus 
only on the scenarios generated by regulators. In an ideal regulatory environment, both 
management-generated scenarios and supervisor-generated scenarios are evaluated. 

There is a danger that financial institutions will find actions they can take that allow 
them to pass the stress tests without making them any safer. An extreme example of 
this is in Business Snapshot 22.2. When regulators defined key scenarios for life insur- 
ance companies and pension funds in Denmark, some of those companies responded by 
hedging against the particular scenarios used by regulators—and only against those sce- 
narios.° This is not what regulators intended. Each scenario used in stress testing should 
be viewed as representative of a range of things that might happen. Financial institutions 
should ensure that their capital will be in good shape not just for the specified scenarios, 
but also for other similar or related scenarios. Regulators are now well aware of this type 
of problem and have procedures aimed at ensuring that banks do not game the system 
in this way. 


22.3 What to Do with the Results 


The biggest problem in stress testing is using the results effectively. All too often, the 
results of stress testing are ignored by senior management. A typical response is, “Yes, 
there are always one or two scenarios that will sink us. We cannot protect ourselves 
against everything that might happen.’ One way of trying to avoid this sort of response 
is to involve senior management in the development of scenarios, as outlined earlier. A 
better response on the part of senior management would be, “Are the risks associated 
with these scenarios acceptable? If not, let’s investigate what trades we can put on or 
other actions we can take to make these types of risks more acceptable.” 


The information in Business Snapshot 22.2 is from P. L. Jorgensen, “Traffic Light Options,” Journal 
of Banking and Finance 31, no. 12 (December 2007): 3698-3719. 
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BUSINESS SNAPSHOT 22.2 
Traffic Light Options 


In June 2001, the Danish Financial Supervisory Authority (DFSA) introduced 
a “traffic light” solvency stress-testing system. This requires life insurance com- 
panies and pension funds to submit semiannual reports indicating the impact on 
them of certain predefined shocks. The “red light scenario” involves a 70-basis- 
point decrease in interest rates,a 12% decline in stock prices, and an 8% decline 
in real estate prices. If capital falls below a specified critical level in this sce- 
nario, the company is categorized with “red light status” and is subject to more 
frequent monitoring with monthly reports being required. The “yellow light 
scenario” involves a 100-basis-point decrease in interest rates, a 30% decline in 
stock prices, and a 12% decline in real estate prices. If capital falls below the crit- 
ical level in this scenario, the company is categorized with “yellow light status” 
and has to submit quarterly reports. When the company’s capital stays above the 
critical levels for the red and yellow light scenarios, the company has a “green 
light status” and is subject to normal semiannual reporting. 

Derivatives dealers have developed products for helping life insurance and 
pension funds keep a green light status. These are known as traffic light options. 
They pay off in the traffic light scenarios so as to give a boost to the finan- 
cial institution’s performance when these scenarios are considered. Traffic light 
options pay off only when the movements close to those specified by regulators 
in the red light or yellow light scenario occur. The options are therefore not too 
expensive. 


they have two separate reports on their desks concerning what could go wrong. One 
report comes from VaR or ES models, the other from internal stress testing. Which one 


The problem for both senior management and the risk management group is that 


should they base their decision making on? 


22.3.1 Integrating Stress Testing and VaR Calculations 


Berkowitz (2000) suggests that stress testing will be taken more seriously if its results are 
integrated into the calculation of VaR.’ This can be done by assigning a probability to 
each stress scenarios that is considered. Suppose that a financial institution has considered 
n, stress scenarios and the total probability assigned to the stress scenarios is p. Assume 


7See J. Berkowitz, “A Coherent Framework for Stress Testing,” Journal of Risk 2, no. 2 (Winter 


1999/2000): 5-15. 
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further that there are n, VaR scenarios generated using historical simulation in the usual 
way. An analyst can assume that there are a total of n, + n, scenarios. The n, stress scenarios 
have probability p and the n, historical scenarios have probability 1 — p. 

Unfortunately human beings are not good at estimating a subjective probability for 
the occurrence of a rare event. To make the task feasible for the stress-testing committee, 
one approach is to ask the stress-testing committee to allocate each scenario to categories 
with preassigned probabilities. The categories might be: 


1. Probability = 0.05%. Extremely unlikely. One chance in 2,000. 

2. Probability = 0.2%. Very unlikely, but the scenario should be given about the same 
weight as the 500 scenarios used in the historical simulation analysis. 

3. Probability = 0.5%. Unlikely, but the scenario should be given more weight than the 
500 scenarios used in the historical simulation analysis. 


Example 22.2 


Suppose that, in the example in Section 13.1, five stress scenarios are considered. They 
lead to losses ($000s) of 235, 300, 450, 750, and 850. The probabilities assigned to the 
scenarios are 0.5%, 0.2%, 0.2%, 0.05%, and 0.05%, respectively. The total probability 
of the stress scenarios is, therefore, 1%. This means that the probability assigned to the 
scenarios generated by historical simulation is 99%. Assuming that equal weighting is 
used, each historical simulation scenario is assigned a probability of 0.99/500 = 0.00198. 
Table 13.4 is therefore replaced by Table 22.1. The probabilities assigned to scenarios are 


Table 22.1 Losses Ranked from Highest to Lowest 


Scenario Loss ($000s) Probability Cumulative Probability 
s5 850.000 0.00050 0.00050 
s4 750.000 0.00050 0.00100 
v494 477.841 0.00198 0.00298 
s3 450.000 0.00200 0.00498 
v339 345.435 0.00198 0.00696 
s2 300.000 0.00200 0.00896 
v349 282.204 0.00198 0.01094 
v329 277.041 0.00198 0.01292 
v487 253.385 0.00198 0.01490 
s1 235.000 0.00500 0.01990 
v227 217.974 0.00198 0.02188 
v131 205.256 0.00198 0.02386 
v238 201.389 0.00198 0.02584 


For Example 22.2, s1,s2,... are the stress scenarios; v1, v2,... are the VaR historical simulation scenarios. 
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accumulated from the worst scenario to the best8 The VaR level when the confidence 
level is 99% is the first loss for which the cumulative probability is greater than 0.01. In 
our example this is $282,204. 


Rebonato (2010) suggests a more elaborate approach to assessing probabilities of sce- 
narios involving a careful application of a well-known result in statistics, Bayes’ theorem, 
and what are known as Bayesian networks.’ The probability of a scenario consisting of 
two events is equal to the probability of the first event happening times the probability 
of the second event happening conditional on the first event having happened. Similarly 
the probability of a scenario consisting of three events is the probability of the first event 
happening times the probability of the second event happening conditional that the first 
event has happened times the probability of the third event happening conditional that 
the first two events have happened. Rebonato’s approach provides a way of evaluating 
the conditional probabilities. 


22.3.2 Subjective vs. Objective Probabilities 


There are two types of probability estimates: objective and subjective. An objective prob- 
ability for an event is a probability calculated by observing the frequency with which 
the event happens in repeated trials. As an idealized example of an objective probability, 
consider an urn containing red balls and black balls in an unknown proportion. We want 
to know the probability of a ball drawn at random from the urn being red. We could 
draw a ball at random, observe its color, replace it in the urn, draw another ball at random, 
observe its color, replace it in the urn, and so on, until 100 balls have been drawn. If 30 of 
the balls that have been drawn are red and 70 are black, our estimate for the probability 
of drawing a red ball is 0.3. Unfortunately, most objective probabilities calculated in real 
life are usually less reliable than the probability in this example, because the probability 
of the event happening does not remain constant for the observations that are available 
and the observations may not be independent. 

A subjective probability is a probability derived from an individual’s personal judgment 
about the chance of a particular event occurring. The probability is not based on histor- 
ical data. It is a degree of belief. Different people are liable to have different subjective 
probabilities for the same event. 

The probabilities in historical simulation are objective whereas the probabilities 
assigned to the scenarios in stress testing are subjective. Many analysts are uncomfortable 
with subjective probabilities because they are not based on data. Also, it is unfortunately 
the case that political considerations may play a part in a financial institution’s decision 


8 This is the same procedure that we used when weights were assigned to historical simulation sce- 
narios. (See Table 13.5 in Section 13.3.) 

?See Riccardo Rebonato, Coherent Stress Testing: A Bayesian Approach to Financial Stress (Chichester, 
UK: John Wiley & Sons, 2010). 


Scenario Analysis and Stress Testing 511 


to focus on historical data. If you use historical data and things go wrong, you can blame 
the data. If you use subjective judgments that have been provided by a group of people, 
those people are liable to be blamed. 

However, if it is based only on objective probabilities, risk management is inevitably 
backward looking and fails to capitalize on the judgment and expertise of senior man- 
agers. It is the responsibility of those managers to steer the financial institution so that 
catastrophic risks are avoided. 


Summary 


Stress testing is an important part of the risk management process. It leads to a financial 
institution considering the impact of extreme scenarios that are ignored by a traditional 
VaR or ES analysis, but that do happen from time to time. Once plausible scenarios 
have been evaluated, the financial institution can take steps to lessen the impact of the 
particularly bad ones. One advantage of a comprehensive stress-testing program is that 
a financial institution obtains a better understanding of the nature of the risks in its 
portfolio. 

Scenarios can be generated in a number of different ways. One approach is to consider 
extreme movements in just one market variable while keeping others fixed. Another is 
to use the movements in all market variables that occurred during periods in the past 
when the market experienced extreme shocks. The best approach is to ask a committee 
of senior management and economists to use their judgment and experience to generate 
the plausible extreme scenarios. Sometimes financial institutions carry out reverse stress 
testing where algorithms are used to search for scenarios that would lead to large losses. 
Scenarios should be as complete as possible and include the impact of knock-on effects 
as well as the initial shock to market variables. The market turmoil starting in summer 
2007 shows that, in some cases, the knock-on effect can be significant and include a flight 
to quality, an increase in credit spreads, and a shortage of liquidity. 

Regulators require financial institutions to keep sufficient capital for stress scenar- 
ios. Regulators in the United States, European Union, and United Kingdom regularly 
develop stress scenarios to be evaluated by all the large financial institutions they are 
responsible for. This may lead to some of the financial institutions having to raise more 
capital and has the potential to uncover systemic risk problems. 

If subjective probabilities are assigned to the extreme scenarios that are considered, 
stress testing can be integrated with a VaR analysis. This is an interesting idea, but was not 
one of the approaches outlined in the Basel Committee consultative document published 
in May 2009. 
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Practice Questions and Problems 
(Answers at End of Book) 


22.1 Explain three different ways that scenarios can be generated for stress testing. 

22.2 What is reverse stress testing? How is it used? 

22.3 Why might the regulatory environment lead to a financial institution underesti- 
mating the severity of the scenarios it considers? 

22.4 What are traffic light options? What are their drawbacks? 

22.5 Why is it important for senior management to be involved in stress testing? 

22.6 What are the advantages and disadvantages of bank regulators choosing some of 
the scenarios that are considered for stress testing? 

22.7 Explain the difference between subjective and objective probabilities. 

22.8 Suppose that, in the example in Section 13.1,seven stress scenarios are considered. 
They lead to losses ($000s) of 240, 280, 340, 500, 700, 850, and 1,050. The sub- 
jective probabilities assigned to the scenarios are 0.5%, 0.5%, 0.2%, 0.2%, 0.05%, 
0.05%, and 0.05%, respectively. What is the new one-day 99% VaR that would be 
calculated using the procedure discussed in Section 22.3? 

22.9 Suppose that the positions in the four options in Example 22.1 are changed to 
200, —70, —120, and —60. Use Solver to calculate the worst-case scenario for a 
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22.10 


22.11 


daily change. Consider asset prices between $40 and $60 and volatilities between 
10% and 30%. 


Further Questions 


What difference does it make to the worst-case scenario in Example 22.1 if (a) the 
options are American rather than European and (b) the options are barrier options 
that are knocked out if the asset price reaches $65? Use Solver to search over asset 
prices between $40 and $60 and volatilities between 18% and 30%. 

What difference does it make to the VaR calculated in Example 22.2 if the 
exponentially weighted moving average model is used to assign weights to 
scenarios as described in Section 13.3? 


Chapter 23 


Operational Risk 


n 1999, the Basel Committee announced plans to assign capital for operational risk 
I in the new Basel II regulations. This met with some opposition from banks. The 

chairman and CEO of one major international bank described it as “the dopiest 
thing I have ever seen.” However, bank supervisors did not back down, because they 
knew that many of the big losses of banks over the previous 10 years had been operational 
risk losses—not credit risk or market risk losses. 

Operational risk has also led to some big losses since the 1999 announcement. A 
big rogue trader loss at Société Générale in 2008 is discussed in Business Snapshot 
5.5, and there was a similar loss at UBS in 2011. The London Whale loss at JPMor- 
gan Chase in 2012 is discussed in Business Snapshot 25.1. In 2014, it was announced 
that the French bank BNP Paribas would pay $9 billion (roughly one year’s profit) to 
the U.S. government for violating economic sanctions by moving dollar-denominated 
transactions through the American banking system on behalf of Sudanese, Iranian, and 
Cuban parties. The bank was also banned from conducting certain U.S. transactions for 
a year. 

An increasingly important type of operational risk for banks is cyber risk. Banks 
have sophisticated systems in place to protect themselves from cyber attacks, but the 
attacks themselves are becoming more sophisticated. Also, banks are making increas- 
ing use of computer systems and the Internet, providing more avenues for cyber fraud. 
Customers and employees have to be continually educated so that the bank’s data 
remain secure. Cyber attacks on banks are attractive to criminals because, to quote the 
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bank robber Willie Sutton, “that’s where the money is.” They are also attractive to 
terrorists because of their potential to damage a nation’s economic security and way 
of life. 

Some regulators now regard operational risk as the most important risk facing banks. 
To quote Thomas J. Curry, head of the Office of the Comptroller of the Currency (OCC) 
in the United States, in 2012: “Given the complexity of today’s banking markets and the 
sophistication of the technology that underpins it, it is no surprise that the OCC deems 
operational risk to be high and increasing. Indeed, it is currently at the top of the list of 
safety and soundness issues for the institutions we supervise.” He goes on to argue that 
operational risk is more important than credit risk! Most banks have always had some 
framework in place for managing operational risk. However, the prospect of new capital 
requirements led them to greatly increase the resources they devote to measuring and 
monitoring operational risk. 

It is much more difficult to quantify operational risk than credit or market risk. 
Operational risk is also more difficult to manage. Financial institutions make a conscious 
decision to take a certain amount of credit and market risk, and there are many traded 
instruments that can be used to reduce these risks. Operational risk, by contrast, is a 
necessary part of doing business. An important part of operational risk management is 
identifying the types of risk that are being taken and should be insured against. There is 
always a danger that a huge loss will be incurred from taking an operational risk that ex 
ante was not even recognized as a risk. 

It might be thought that a loss such as that at Société Générale (SocGen, see Business 
Snapshot 5.5) was a result of market risk because it was movements in market variables 
that led to it. However, it should be classified as an operational risk loss because it involved 
fraud. (Jérôme Kerviel created fictitious trades to hide the big bets he was making.) 
Suppose there was no fraud. If it was part of the bank’s policy to let traders take huge 
risks, then the loss would be classified as a market risk loss. But, if this was not part 
of the bank’s policy and there was a breakdown in its controls, it would be classified 
as an operational risk loss. The SocGen example illustrates that operational risk losses 
are often contingent on market movements. If the market had moved in Kerviel’s favor, 
there would have been no loss. The fraud and the breakdown in SocGen’s control systems 
might then never have come to light. 

There are some parallels between the operational risk losses of banks and the losses 
of insurance companies. Insurance companies face a small probability of a large loss from 
a hurricane, earthquake, or other natural disaster. Similarly, banks face a small probability 
of a large operational risk loss. But there is one important difference. When insurance 
companies lose a large amount of money because of a natural disaster, all companies in 
the industry tend to be affected and often premiums rise the next year to cover losses. 
Operational risk losses tend to affect only one bank. Because it operates in a competitive 


| Speech to the Exchequer Club, May 16, 2012. 
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environment, the bank does not have the luxury of increasing prices for the services it 
offers during the following year. 


23.1 Defining Operational Risk 


There are many different ways in which operational risk can be defined. It is tempting 
to consider operational risk as a residual risk and define it as any risk faced by a financial 
institution that is not market risk or credit risk. To produce an estimate of operational 
risk, we could then look at the financial institution’s financial statements and remove 
from the income statement (a) the impact of credit losses and (b) the profits or losses from 
market risk exposure. The variation in the resulting income would then be attributed to 
operational risk. 

Most people agree that this definition of operational risk is too broad. It includes the 
risks associated with entering new markets, developing new products, economic factors, 
and so on. Another possible definition is that operational risk, as its name implies, is the 
risk arising from operations. This includes the risk of mistakes in processing transactions, 
making payments, and so on. This definition of risk is too narrow. It does not include 
major risks such as the risk of a rogue trader such as Jérôme Kerviel. 

We can distinguish between internal risks and external risks. Internal risks are those 
over which the company has control. The company chooses whom it employs, what 
computer systems it develops, what controls are in place, and so on. Some people define 
operational risks as all internal risks. Operational risk then includes more than just the 
risk arising from operations. It includes risks arising from inadequate controls such as the 
rogue trader risk and the risks of other sorts of employee fraud. 

Bank regulators favor including more than just internal risks in their definition of 
operational risk. They include the impact of external events such as natural disasters 
(for example, a fire or an earthquake that affects the bank’s operations), political and 
regulatory risk (for example, being prevented from operating in a foreign country by 
that country’s government), security breaches, and so on. All of this is reflected in the 
following definition of operational risk produced by the Basel Committee on Banking 
Supervision in 2001: 


The risk of loss resulting from inadequate or failed internal processes, people, and systems 
or from external events. 


Note that this definition includes legal risk but does not include reputation risk and the 
risk resulting from strategic decisions. 

Operational risks result in increases in the bank’s costs or decreases in its revenue. 
Some operational risks interact with credit and market risk. For example, when mistakes 
are made in a loan’s documentation, it is usually the case that losses result if and only if 
the counterparty defaults. When a trader exceeds limits and misreports positions, losses 
result if and only if the market moves against the trader. 
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23.2 Categorization of Operational Risks 


The Basel Committee on Banking Supervision has identified seven categories of oper- 


ational risk? These are: 


1. 


Internal fraud: Acts of a type intended to defraud, misappropriate property, or circum- 
vent regulations, the law, or company policy (excluding those concerned with diver- 
sity or discrimination) involving at least one internal party. Examples include inten- 
tional misreporting of positions, employee theft, insider trading on an employee’s 
own account, and rogue trading. 


. External fraud: Acts by a third party of a type intended to defraud, misappropriate 


property, or circumvent the law. Examples include robbery, forgery, check kiting, and 
damage from computer hacking. 


. Employment practices and workplace safety: Acts inconsistent with employment, health, or 


safety laws or agreements, or that result in payment of personal injury claims or claims 
relating to diversity or discrimination issues. Examples include worker compensation 
claims, violation of employee health and safety rules, organized labor activities, dis- 
crimination claims, and general liability (for example, a customer slipping and falling 
at a branch office). 


. Clients, products, and business practices: Unintentional or negligent failure to meet a 


professional obligation to clients and the use of inappropriate products or business 
practices. Examples are fiduciary breaches, misuse of confidential customer informa- 
tion, improper trading activities on the bank’s account, money laundering, and the 
sale of unauthorized products. 


. Damage to physical assets: Loss or damage to physical assets from natural disasters or 


other events. Examples include terrorism, vandalism, earthquakes, fires, and floods. 


. Business disruption and system failures: Disruption of business or system failures. Exam- 


ples include hardware and software failures, telecommunication problems, and utility 
outages. 


. Execution, delivery, and process management: Failed transaction processing or process 


management, and disputes with trade counterparties and vendors. Examples include 
data entry errors, collateral management failures, incomplete legal documentation, 
unapproved access given to clients’ accounts, non-client counterparty misperfor- 
mance, and vendor disputes. 


Eight different business lines within a bank can be distinguished: 


1. 
2: 
3. 
4. 


Corporate finance 
Trading and sales 
Retail banking 


Commercial banking 


? See Basel Committee on Banking Supervision, “Sound Practices for the Management and Supervi- 


sion of Operational Risk,” Bank for International Settlements, February 2003. 
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5. Payment and settlement 
6. Agency services 

7. Asset management 

8. Retail brokerage 


Combining the seven categories of risk with the eight business lines gives a total of 
7 X 8 = 56 potential sources of operational risk for a bank. 


23.3 Regulatory Capital Under Basel II 


We now discuss how regulatory capital is determined under Basel II. As we will see in 
Section 23.4, the Basel Committee has indicated its intention to totally change its pro- 
cedures for determining operational risk regulatory capital. In particular, it stated that 
the Advanced Measurement Approach (AMA), which is a complex procedure for deter- 
mining the one-year 99% VaR for operational risk losses, is to be abandoned. However, 
many banks have devoted huge resources to implementing AMA, and it is unlikely to 
disappear altogether. Indeed, regulators in some countries have indicated that their banks 
are welcome to continue using AMA. We will therefore review AMA in this section. 
Banks have three alternatives for determining operational risk regulatory capital 
under Basel II. The simplest approach is the basic indicator approach. Under this approach, 
operational risk capital is set equal to 15% of annual gross income over the previous 
three years. Gross income is defined as net interest income plus noninterest income? A 
slightly more complicated approach is the standardized approach. In this, a bank’s activities 
are divided into the eight business lines mentioned earlier: corporate finance, trading and 
sales, retail banking, commercial banking, payment and settlement, agency services, asset 
management, and retail brokerage. The average gross income over the last three years for 
each business line is multiplied by a “beta factor” for that business line and the result 
summed to determine the total capital. The beta factors are shown in Table 23.1. The 
third alternative is the advanced measurement approach (AMA). In this, the operational risk 


Table 23.1 Beta Factors in the Standardized Approach 


Business Line Beta Factor 
Corporate finance 18% 
Trading and sales 18% 
Retail banking 12% 
Commercial banking 15% 
Payment and settlement 18% 
Agency services 15% 
Asset management 12% 
Retail brokerage 12% 


$ Net interest income is the excess of income earned on loans over interest paid on deposits and other 
instruments that are used to fund the loans (see Section 2.2). 
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Figure 23.1 Calculation of Capital for Operational Risk 


regulatory capital requirement is calculated by the bank internally using qualitative and 
quantitative criteria. Similarly to credit capital, it is based on a one-year 99.9% VaR. 

The objective of banks using the AMA approach for operational risk is analogous to 
their objectives when they attempt to quantify credit risk. They would like to produce a 
probability distribution of losses such as that shown in Figure 23.1. Assuming that they 
can convince regulators that their expected operational risk cost is incorporated into 
their pricing of products, capital is assigned to cover unexpected costs. The unexpected 
loss is the difference between the one-year 99.9% VaR and the expected one-year loss. 

As mentioned, there are 7 X 8 = 56 combinations of risk types and business lines. 
Banks must estimate one-year 99.9% VaRs for each combination and then aggregate 
them, using the approach described in Section 12.9, to determine a single one-year 99.9% 
operational risk VaR measure. 


23.3.1 Loss Severity and Loss Frequency 


There are two distributions that are important in estimating potential operational risk 
losses for a risk type/business line combination. One is the loss frequency distribution and 
the other is the loss severity distribution. The loss frequency distribution is the distribution 
of the number of losses observed during one year. The loss severity distribution is the 
distribution of the size of a loss, given that a loss occurs. It is usually assumed that loss 
severity and loss frequency are independent. 

For loss frequency, the natural probability distribution to use is a Poisson distribution. 
This distribution assumes that losses happen randomly through time so that in any short 
period of time At there is a probability AAt of a loss occurring. The probability of n 
losses in T years is 


er (A T) n 
n! 
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The parameter À can be estimated as the average number of losses per year. For exam- 
ple, if during a 10-year period there were a total of 12 losses, à would be estimated as 
1.2 per year. A Poisson distribution has the property that the mean frequency of losses 
equals the variance of the frequency of losses.* 

For the loss-severity probability distribution, a lognormal probability distribution is 
often used. The parameters of this probability distribution are the mean and standard 
deviation of the logarithm of the loss. 

The loss-frequency distribution can be combined with the loss severity distribution 
for each risk type/business line combination to determine a loss distribution. Monte 
Carlo simulation can be used for this purpose” As mentioned earlier, the usual assumption 
is that loss severity is independent of loss frequency. On each simulation trial, we proceed 
as follows: 


1. We sample from the frequency distribution to determine the number of loss events 
(= n) in one year. 

2. We sample n times from the loss severity distribution to determine the loss experi- 
enced for each loss event (L4, Ly, ..., Lp). 


3. We determine the total loss experienced (= L} + L, + +++ + L,). 


When many simulation trials are used, we obtain a total loss distribution for losses of the 
type being considered. The 99.9 percentile of the distribution can then be determined. 

Figure 23.2 illustrates the procedure. In this example, the expected loss frequency is 
3 per year and the loss severity is drawn from a lognormal distribution. The logarithm 
of each loss ($ millions) is assumed to have a mean of zero and a standard deviation 
of 0.4. The Excel worksheet used to produce Figure 23.2 is on the author’s website: 
www-2.rotman.utoronto.ca/hull/riskman. 

The Basel Committee requires the implementation of AMA to involve four elements: 
internal data, external data, scenario analysis, and business environment and internal con- 
trol factors We will now consider each of these in turn. 


23.3.2 Internal Data 


Unfortunately, there is usually relatively little historical data available within a bank to 
estimate loss severity and loss frequency distributions for particular types of losses. Many 


‘Tf the mean frequency is greater than the variance of the frequency, a binomial distribution may be 
more appropriate. If the mean frequency is less than the variance, a negative binomial distribution 
(mixed Poisson distribution) may be more appropriate. 

> Combining the loss severity and loss frequency distribution is a very common problem in insurance. 
Apart from Monte Carlo simulation, two approaches that are used are Panjer’s algorithm and fast 
Fourier transforms. See H. H. Panjer, “Recursive Evaluation of a Family of Compound Distributions,” 
ASTIN Bulletin 12 (1981): 22-29. 

See Basel Committee on Banking Supervision, “Operational Risk: Supervisory Guidelines for the 
Advanced Measurement Approach,” June 2011. 
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Figure 23.2 Calculation of Loss Distribution from Loss Frequency and Loss Severity 


banks have not in the past kept records of operational risk losses. They are doing so 
now, but it may be some time before a reasonable amount of historical data is available. 
It is interesting to compare operational risk losses with credit risk losses in this respect. 
Traditionally, banks have done a much better job at documenting their credit risk losses 
than their operational risk losses. Also, in the case of credit risks, a bank can rely on a 
wealth of information published by credit rating agencies to assess probabilities of default 
and expected losses given default. Similar data on operational risk have not been collected 
in a systematic way. 

There are two types of operational risk losses: high-frequency low-severity losses 
(HFLSLs) and low-frequency high-severity losses (LFHSLs). An example of the first is 
credit card fraud losses. An example of the second is rogue trader losses. A bank should 
focus its attention on LFHSLs. These are what create the tail of the loss distribution. A 
particular percentile of the total loss distribution can be estimated as the corresponding 
percentile of the total LFHSL distribution plus the average of the total HFLSL. Another 
reason for focusing on LFHSLs is that HFLSLs are often taken into account in the pricing 
of products. 

By definition, LFHSLs occur infrequently. Even if good records have been kept, inter- 
nal data are liable to be inadequate, and must be supplemented with external data and 
scenario analysis. As we will describe, external data can be used for the loss severity dis- 
tribution. The loss frequency distribution must be specific to the bank and based on 
internal data and scenario analysis estimates. 
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23.3.3 External Data 


There are two sources of external data. The first is data consortia, which are compa- 
nies that facilitate the sharing of data between banks. (The insurance industry has had 
mechanisms for sharing loss data for many years and banks are now doing this as well.) 
The second is data vendors, who are in the business of collecting publicly available data 
in a systematic way. External data increases the amount of data available to a bank for 
estimating potential losses. It also has the advantage that it can lead to a consideration of 
types of losses that have never been incurred by the bank, but which have been incurred 
by other banks. 

Both internal and external historical data must be adjusted for inflation. In addition, a 
scale adjustment should be made to external data. Ifa bank with a revenue of $10 billion 
reports a loss of $8 million, how should the loss be scaled for a bank with a revenue 
of $5 billion? A natural assumption is that a similar loss for a bank with a revenue of 
$5 billion would be $4 million. But this estimate is probably too small. For example, 
research by Shih et al. (2000) suggests that the effect of firm size on the size of a loss 
experienced is non-linear.’ Their estimate is 


Estimated Loss for Bank A = Observed Loss for Bank B X (Sunk A Revene Eves ) 

Bank B Revenue 
where & = 0.23. This means that in our example the bank with a revenue of $5 bil- 
lion would experience a loss of 8 X 0.5% = $6.82 million. After the appropriate scale 
adjustment, data obtained through sharing arrangements with other banks can be merged 
with the bank’s own data to obtain a larger sample for determining the loss severity 
distribution. 

The loss data available from data vendors are data taken from public sources such as 
newspapers and trade journals. Data from vendors cannot be used in the same way as 
internal data or data obtained through sharing arrangements because they are subject to 
biases. For example, only large losses are publicly reported, and the larger the loss, the 
more likely it is to be reported. 

Data from vendors are most useful for determining relative loss severity. Suppose 
that a bank has good information on the mean and standard deviation of its loss severity 
distribution for internal fraud in corporate finance, but not for external fraud in corporate 
finance or for internal fraud in trading and sales. Suppose that the bank estimates the mean 
and standard deviation of its loss severity distribution for internal fraud in corporate 
finance as $50 million and $30 million. Suppose further that external data indicate that, 
for external fraud in corporate finance, the mean severity is twice that for internal fraud 
in corporate finance and the standard deviation of the severity is 1.5 times as great. In the 


7See J. Shih, A. Samad-Khan, and P. Medapa, “Is the Size of an Operational Loss Related to Firm 
Size?” Operational Risk Magazine 2, no. 1 (January 2000). Whether Shih et als results apply to legal 
risks is debatable. The size of a settlement in a large lawsuit against a bank can be governed by how 
much the bank can afford. 
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absence of a better alternative, the bank might assume that its own severity for external 
fraud in corporate finance has a mean of 2 X 50 = $100 million and a standard deviation 
of severity equal to 1.5 X 30 = $45 million. Similarly, if the external data indicate that 
the mean severity for internal fraud in trading and sales is 2.5 times that for internal fraud 
in corporate finance and the standard deviation is twice as great, the bank might assume 
that its own severity for internal fraud in trading and sales has a mean of 2.5 x 50 = 
$125 million and a standard deviation of 2 X 30 = $60 million. 


23.3.4 Scenario Analysis 


Scenario analysis has become a key tool for the assessment of operational risk under the 
AMA. The aim of scenario analysis is to generate scenarios covering the full range of 
possible LFHSLs. Some of these scenarios might come from the bank’s own experience, 
some might be based on the experience of other banks, some might come from the 
work of consultants, and some might be generated by the risk management group in 
conjunction with senior management or business unit managers. The Basel Committee 
estimates that at many banks the number of scenarios considered that give rise to a loss 
of greater than 10 million euros is approximately 20 times larger than the number of 
internal losses of this amount. 

An operational risk committee consisting of members of the risk management group 
and senior management should be asked to estimate loss severity and loss frequency 
parameters for the scenarios. As explained earlier, a lognormal distribution is often used 
for loss severity and a Poisson distribution is often used for loss frequency. Data from 
other banks may be useful for estimating the loss severity parameters. The loss frequency 
parameters should reflect the controls in place at the bank and the type of business it is 
doing. They should reflect the views of the members of the operational risk committee. 
Similarly to Section 22.3, a number of categories of loss frequency can be defined, 
such as: 


1. Scenario happens once every 1,000 years on average (A = 0.001) 
2. Scenario happens once every 100 years on average (A = 0.01) 

3. Scenario happens once every 50 years on average (A = 0.02) 

4. Scenario happens once every 10 years on average (A = 0.1) 

5. Scenario happens once every 5 years on average (A = 0.2) 


The committee can be asked to assign each scenario that is developed to one of the 
categories. 

One difference between this scenario analysis and the one in Chapter 22 is that 
there is no model for determining losses and, if data are not available, the parameters of 
the loss severity distribution have to be estimated by the committee. One approach is 
to ask the committee to estimate an average loss and a high loss that the committee is 
99% certain will not be exceeded. A lognormal distribution can then be fitted to the 
estimates. 
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Fortunately, the operational risk environment does not usually change as fast as the 
market and credit risk environment so that the amount of work involved in developing 
scenarios and keeping them up to date should not be as onerous. Nevertheless, the 
approach we have described does require a great deal of senior management time. The 
relevant scenarios for one bank are often similar to those for other banks and, to lessen 
the burden on the operational risk committee, there is the potential for standard scenar- 
ios to be developed by consultants or by bank industry associations. However, the loss 
frequency estimates should always be specific to a bank and reflect the controls in place 
in the bank and the type of business it is currently doing. 

As in the case of market and credit risk stress testing, the advantage of generating 
scenarios using managerial judgment is that they include losses that the financial insti- 
tution has never experienced, but could incur. The scenario analysis approach leads to 
management thinking actively and creatively about potential adverse events. This can 
have a number of benefits. In some cases, strategies for responding to an event so as to 
minimize its severity are likely to be developed. In other cases, proposals may be made 
for reducing the probability of the event occurring at all. 

Whether scenario analysis or internal/external data approaches are used, distribu- 
tions for particular loss types have to be combined to produce a total operational risk 
loss distribution. The correlations assumed for the losses from different operational risk 
categories can make a big difference to the one-year 99.9% VaR that is calculated, and 
therefore to the AMA capital. Chapter 26 builds on material in Section 12.9 to explain 
how correlations can be used to aggregate economic capital requirements across market 
risk, credit risk, and operational risk. The same approach can be used to aggregate differ- 
ent operational risk capital requirements. It is often argued that operational risk losses are 
largely uncorrelated with each other and there is some empirical support for this view. 
If the zero-correlation assumption is made, Monte Carlo simulation can be used in a 
straightforward way to sample from the distribution of losses for each scenario to obtain 
a total distribution of risk losses. 


23.3.5 Business Environment and Internal Control Factors 


Business environment and internal control factors (BEICFs) should be taken into account 
when loss severity and loss frequency are estimated. These include the complexity of 
the business line, the technology used, the pace of change, the level of supervision, staff 
turnover rates, and so on. For example, factors influencing the estimates made for the 
rogue trader scenario might be the level of supervision of traders, the level of trade surveil- 
lance, and the strengths or weaknesses of the systems used by the middle and back office. 


23.4 The Standardized Measurement Approach 


One thing the Basel Committee is always looking for is consistency across banks in the 
application of its regulations. If different banks would calculate quite different capital 
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Table 23.2 Calculation of BI Component from BI 


BI (billions BI Component 

of euros) (millions of euros) 

0 to 1 110 x BI 

1 to 3 110 + 150 x (BI-1) 

3 to 10 410 + 190 x (BI-3) 

10 to 30 1,740 + 230 x (BI-10) 
>30 6,340 + 290 x (BI-30) 


charges from the same set of data, the Basel Committee feels that the underlying regula- 
tion is not working well. This was the situation with AMA. The Basel Committee hoped 
that the flexibility in AMA would ultimately lead to the emergence of best practice. In 
fact, the inherent complexity of AMA and the wide range of internal modeling practices 
led to a high variability in capital charges. 

In March 2016, the Basel Committee issued a consultative document indicating that 
it was going to replace all previous approaches for determining operational risk capi- 
tal with a new approach: the Standardized Measurement Approach (SMA)! Banks that 
invested heavily in implementing AMA were understandably annoyed and, as indicated 
earlier, it may well be the case that some national supervisors will allow them to continue 
using AMA. 

SMA is simpler than AMA. It first defines a quantity known as the Business Indicator 
(BI). This is a measure of bank size, similar to the Gross Income (GI) measure used in 
the Basel II basic indicator approach. Key differences are: 


1. Whereas a negative income (from, for example, trading) would reduce GI, it does 
not do so for BI. 

2. BI includes some items that produce operational risk but are omitted from GI. 

3. Operating expenses reduce GI, but because they can lead to operational risk they 
increase BI. 


The BI Component for a bank is calculated from its BI using the piecewise linear rela- 
tionship indicated in Table 23.2. The dollar increase in BI Component for a $1 increase 
in BI increases as the size of the bank increases. For small banks with BIs in the range 0 
to 1 billion euros, it is 0.11. It rises to 0.15 for banks with BIs in the 1 billion to 3 billion 
euro range, to 0.19 for banks with BIs in the 3 billion to 10 billion euro range, to 0.23 
for banks with BIs in the 10 billion to 30 billion euro range, and to 0.29 for banks with 
BIs above 30 billion euros. 


8 See Basel Committee on Banking Supervision, “Consultative Document: Standardized Approach for 
Operational Risk,’ March 2016. 
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Banks with BIs greater than one billion euros are required to calculate a Loss Com- 
ponent as follows: 


Loss Component = 7X +7Y +5Z 


where X is the average total annual loss from operational risk, Y is the average total 
annual loss including only losses greater than 10 million euros, and Z is the average total 
annual loss including only losses greater than 100 million euros. The averages should be 
over 10 years of good-quality loss data? The Loss Component and BI Component are 
calculated so that, according to Basel Committee estimates, Loss Component equals BI 
Component for an average bank. 

The Internal Loss Multiplier is calculated as 


ee. Loss Component 
Internal Loss Multiplier = In {| e — 1 + ————————— 
BI Component 


where e is the exponential constant (2.718). The calculations are designed so that the 
estimated Internal Loss Multiplier for an average bank is 1. Banks with a loss experience 
above (below) the industry average will have an Internal Loss Multiplier greater than (less 
than) 1. The lowest possible Internal Loss Multiplier is In(1.718) or 0.514. 

Finally, the SMA capital requirement can be calculated. For small banks whose BI is 
less than 1 billion euros, the SMA capital is set equal to the BI Component (so that these 
small banks are all regarded as “average”’). For other banks it is calculated (in millions of 
euros) as: 


SMA Capital = 110 + (BI Component — 110) * Internal Loss Multiplier 


23.5 Preventing Operational Risk Losses 


Up to now we have focused on the measurement of operational risk for the purposes 
of determining capital. A key part of an operational risk manager’s job is to be proactive 
and try to prevent losses before they occur. We now consider some of the ways this can 
be done. 

One approach is to monitor what is happening at other banks and try and learn 
from their mistakes. When a $700 million rogue trader loss happened at a Baltimore 
subsidiary of Allied Irish Bank in 2002, risk managers throughout the world studied the 
situation carefully and asked: “Could this happen to us?” Business Snapshot 23.1 describes 
a situation concerning a British local authority in the late 1980s. It immediately led to all 


° During a transition period, banks that do not have 10 years of good-quality loss data may use a 
minimum of five years of data. 
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BUSINESS SNAPSHOT 23.1 
The Hammersmith and Fulham Story 


Between 1987 and 1989, the London Borough of Hammersmith and Fulham 
in Great Britain entered into about 600 interest rate swaps and related transac- 
tions with a total notional principal of about six billion pounds. The transactions 
appear to have been entered into for speculative rather than hedging purposes. 
The two employees of Hammersmith and Fulham who were responsible for the 
trades had only a sketchy understanding of the risks they were taking and how 
the products they were trading worked. 

By 1989, because of movements in sterling interest rates, Hammersmith and 
Fulham had lost several hundred million pounds on the swaps. To the banks on 
the other side of the transactions, the swaps were worth several hundred million 
pounds. The banks were concerned about credit risk. They had entered into 
offsetting swaps to hedge their interest rate risks. If Hammersmith and Fulham 
defaulted, they would still have to honor their obligations on the offsetting swaps 
and would take a huge loss. 

What actually happened was not a default. Hammersmith and Fulham’s 
auditor asked to have the transactions declared void because Hammersmith and 
Fulham did not have the authority to enter into the transactions. The British 
courts agreed. The case was appealed and went all the way to the House of Lords, 
Britain’s highest court. The final decision was that Hammersmith and Fulham 
did not have the authority to enter into the swaps, but that they ought to have 
the authority to do so in the future for risk management purposes. Needless 
to say, banks were furious that their contracts were overturned in this way by 
the courts. 


banks instituting procedures for checking that counterparties had the authority to enter 


into derivatives transactions. 


23.5.1 Causal Relationships 


Operational risk managers should try to establish causal relations between decisions taken 
and operational risk losses. Does increasing the average educational qualifications of 
employees reduce losses arising from mistakes in the way transactions are processed? Will 
a new computer system reduce the probabilities of losses from system failures? Are oper- 
ational risk losses correlated with the employee turnover rate? Can the risk of a rogue 
trader be reduced by the way responsibilities are divided between different individuals 


and by the way traders are motivated? 
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One approach to establishing causal relationships is statistical. If we look at 12 dif- 
ferent locations where a bank operates and find a high negative correlation between the 
education of back-office employees and the cost of mistakes in processing transactions, it 
might well make sense to do a cost-benefit analysis of changing the educational require- 
ments for a back-office job in some of the locations. In some cases, a detailed analysis of 
the cause of losses may provide insights. For example, if 40% of computer failures can be 
attributed to the fact that the current hardware is several years old and less reliable than 
newer versions, a cost-benefit analysis of upgrading is likely to be useful. 


23.5.2 RCSA and KRIs 


Risk control and self-assessment (RCSA) is an important way in which banks try to 
achieve a better understanding of their operational risk exposures. It involves asking the 
managers of business units to identify their operational risks. Sometimes questionnaires 
and scorecards designed by senior management or consultants are used. 

A by-product of any program to measure and understand operational risk is likely to 
be the development of key risk indicators (KRIs).!° Risk indicators are key tools in the 
management of operational risk. The most important indicators are prospective. They 
provide an early warning system to track the level of operational risk in the organization. 
Examples of key risk indicators that could be appropriate in particular situations are 


. Staff turnover 

. Number of failed transactions 

. Number of positions filled by temps 
. Ratio of supervisors to staff 

. Number of open positions 


au & &N 


. Percentage of staff who did not take 10 days consecutive leave in the last 12 months 


The hope is that key risk indicators can identify potential problems and allow reme- 
dial action to be taken before losses are incurred. It is important for a bank to quantify 
operational risks, but it is even more important to take actions that control and manage 
those risks. 


23.5.3 E-Mails and Phone Calls 


One way in which operational risk costs can be mitigated is by educating employees to 
be very careful about what they write in e-mails and, when they work on the trading 
floor, what they say in phone calls. Lawsuits or regulatory investigations contending that 
a financial institution has behaved inappropriately or illegally are a major source of oper- 
ational risk. One of the first things that happens when a case is filed is “discovery,” where 
the financial institution is required to provide all relevant internal communications. These 


'OThese are sometimes referred to as Business Environment and Internal Control Factors (BEICFs). 
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have often proved to be very embarrassing and have made it difficult for financial insti- 
tutions to defend themselves. (Fabrice Tourre’s e-mail in Business Snapshot 27.3 is an 
example of an embarrassing e-mail.) Before sending an e-mail or making a recorded 
phone call, an employee should consider the question “How could it hurt my employer 
if this became public knowledge?” 


23.6 Allocation of Operational Risk Capital 


Operational risk capital should be allocated to business units in a way that encourages 
them to improve their operational risk management. Methods for doing this are discussed 
in Sections 12.8 and 26.6. If a business unit can show that it has taken steps to reduce 
the frequency or severity of a particular risk, it should be allocated less capital. This will 
have the effect of improving the business unit’s return on capital (and possibly lead to 
the business unit manager receiving an increased bonus). 

Note that it is not always optimal for a manager to reduce a particular operational 
risk. Sometimes the costs of reducing the risk outweigh the benefits of reduced capital so 
that return on allocated capital decreases. A business unit should be encouraged to make 
appropriate calculations and determine the amount of operational risk that maximizes 
return on capital. 

The overall result of operational risk assessment and operational risk capital allocation 
should be that business units become more sensitive to the need for managing operational 
risk. Hopefully, operational risk management will be seen to be an important part of 
every manager’s job. A key ingredient for the success of any operational risk program 
is the support of senior management. The Basel Committee on Banking Supervision is 
very much aware of this. It recommends that a bank’s board of directors be involved in the 
approval of a risk management program and that it review the program on a regular basis. 


23.7 Use of Power Law 


In Section 10.4, we introduced the power law. This states that, for a wide range of variables 
Prob(v > x) = Kx * 


where v is the value of the variable, x is a relatively large value of v, and K and @ are con- 
stants. We covered the theoretical underpinnings of the power law and maximum like- 
lihood estimation procedures when we looked at extreme value theory in Section 13.5. 

De Fontnouvelle et al. (2003), using data on losses from vendors, found that the power 
law holds well for the large losses experienced by banks.'! This makes the calculation 


'lSee P. de Fontnouvelle, V. DeJesus-Rueff, J. Jordan, and E. Rosengren, “Capital and Risk: New 
Evidence on Implications of Large Operational Risk Losses,” Journal of Money, Credit and Banking 38, 
no. 7 (October 2006): 1819-1846. 
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of VaR with high degrees of confidence, such as 99.9%, easier. Loss data (internal or 
external) and scenario analysis are employed to estimate the power law parameters using 
the maximum likelihood approach in Chapter 13. The 99.9 percentile of the loss distri- 
bution can then be estimated using equation (13.9). 

When loss distributions are aggregated, the distribution with the heaviest tails tends 
to dominate. This means that the loss with the lowest defines the extreme tails of the 
total loss distribution.'* Therefore, if all we are interested in is calculating the extreme 
tail of the total operational risk loss distribution, it may only be necessary to consider 
one or two business line/risk type combinations. 


23.8 Insurance 


An important decision for operational risk managers is the extent to which operational 
risks should be insured against. Insurance policies are available on many different kinds of 
risk ranging from fire losses to rogue trader losses. In Section 3.7 we discussed the moral 
hazard and adverse selection risks faced by insurance companies. We now review these 
again in the context of operational risk. 


23.8.1 Moral Hazard 


One of the risks facing a company that insures a bank against operational risk losses is 
moral hazard. This is the risk that the existence of the insurance contract will cause the 
bank to behave differently than it otherwise would. This changed behavior increases the 
risks to the insurance company. Consider, for example, a bank that insures itself against 
robberies. As a result of the insurance policy, it may be tempted to be lax in its imple- 
mentation of security measures—making a robbery more likely than it would otherwise 
have been. 

Insurance companies have traditionally dealt with moral hazard in a number of ways. 
Typically there is a deductible in any insurance policy. This means that the bank is respon- 
sible for bearing the first part of any loss. Sometimes there is a coinsurance provision in a 
policy. In this case, the insurance company pays a predetermined percentage (less than 
100%) of losses in excess of the deductible. In addition, there is nearly always a policy limit. 
This is a limit on the total liability of the insurer. Consider again a bank that has insured 
itself against robberies. The existence of deductibles, coinsurance provisions, and policy 
limits is likely to provide an incentive for a bank not to relax security measures in its 
branches. The moral hazard problem in rogue trader insurance is discussed in Business 
Snapshot 23.2. 


12The parameter € in extreme value theory (see Chapter 13) equals 1/a, so it is the loss distribution 
with the largest € that defines the extreme tails. 
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BUSINESS SNAPSHOT 23.2 
Rogue Trader Insurance 


A rogue trader insurance policy presents particularly tricky moral hazard prob- 
lems. An unscrupulous bank could enter into an insurance contract to protect 
itself against losses from rogue trader risk and then choose to be lax in its imple- 
mentation of trading limits. Ifa trader exceeds the trading limit and makes a large 
profit, the bank is better off than it would be otherwise. If a large loss results, a 
claim can be made under the rogue trader insurance policy. Deductibles, coin- 
surance provisions, and policy limits may mean that the amount recovered is less 
than the loss incurred by the trader. However, potential net losses to the bank are 
likely to be far less than potential profits, making the lax trading limits strategy a 
good bet for the bank. 

Given this problem, it is perhaps surprising that some insurance companies 
do offer rogue trader insurance policies. These companies tend to specify care- 
fully how trading limits are implemented within the bank. They may require that 
the existence of the insurance policy not be revealed to anyone on the trading 
floor. They are also likely to want to retain the right to investigate the circum- 
stances underlying any loss. 

From the bank’s point of view, the lax trading limits strategy we have outlined 
may be very short-sighted. The bank might well find that the costs of all types of 
insurance rise significantly as a result of a rogue trader claim. Also, a large rogue 
trader loss (even if insured) would cause its reputation to suffer. 


23.8.2 Adverse Selection 


The other major problem facing insurance companies is adverse selection. This is where 
an insurance company cannot distinguish between good and bad risks. It offers the same 
price to everyone and inadvertently attracts more of the bad risks. For example, banks 
without good internal controls are more likely to enter into rogue trader insurance con- 
tracts; banks without good internal controls are more likely to buy insurance policies to 
protect themselves against external fraud. 

To overcome the adverse selection problem, an insurance company must try to 
understand the controls that exist within banks and the losses that have been experi- 
enced. As a result of its initial assessment of risks, it may not charge the same premium 
for the same contract to all banks. As time goes by, it gains more information about the 
bank’s operational risk losses and may increase or reduce the premium charged. This is 
much the same as the approach adopted by insurance companies when they sell auto- 
mobile insurance to a driver. At the outset, the insurance company obtains as much 
information on the driver as possible. As time goes by, it collects more information on 
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the driver’s risk (number of accidents, number of speeding tickets, etc.) and modifies the 
premium charged accordingly. 


23.9 Sarbanes—Oxley 


Largely as a result of the Enron bankruptcy, the Sarbanes-Oxley Act was passed in the 
United States in 2002. This provides another dimension to operational risk management 
for both financial and nonfinancial institutions in the United States. The act requires 
boards of directors to become much more involved with day-to-day operations. They 
must monitor internal controls to ensure risks are being assessed and handled well. 

The act specifies rules concerning the composition of the board of directors of public 
companies and lists the responsibilities of the board. It gives the SEC the power to censure 
the board or give it additional responsibilities. A company’s auditors are not allowed 
to carry out any significant non-auditing services for the company.’ Audit partners 
must be rotated. The audit committee of the board must be made aware of alternative 
accounting treatments. The CEO and CFO must prepare a statement to accompany 
the audit report to the effect that the financial statements are accurate. The CEO and 
CFO are required to return bonuses in the event that financial statements are restated. 
Other rules concern insider trading, disclosure, personal loans to executives, reporting of 
transactions by directors, and the monitoring of internal controls by directors. 


Summary 


In 1999, bank supervisors indicated their intention to charge capital for operational risk 
as part of Basel II. This has led banks to carefully consider how they should measure and 
manage operational risk. Bank supervisors have identified seven different types of oper- 
ational risks and eight different business lines. The most sophisticated banks attempted 
to quantify risks for each of the 56 risk type/business line combinations. 

Operational risk losses of a particular type can in theory be treated in much the same 
way as actuaries treat losses from insurance policies. A frequency of loss distribution and 
a severity of loss distribution can be estimated and then combined to form a total loss 
distribution. The advanced measurement approach (AMA) in Basel II required banks 
to use internal data, external data, scenario analysis, and business environment and risk 
control factors to estimate these distributions. 

In 2016, the Basel Committee announced that AMA was not working as well as 
hoped and indicated that it was planning to require banks to use a new approach: the 
Standardized Measurement Approach (SMA). As part of SMA, the Basel Committee 
developed the Business Indicator, a measure of the size of a bank that is relevant to its 


'5Enron’s auditor, Arthur Andersen, provided a wide range of services in addition to auditing. The 
company did not survive the litigation that followed the downfall of Enron. 
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potential operational risk exposure. This is used in conjunction with the bank’s aver- 
age loss experience over the previous 10 years to determine operational risk regulatory 
capital. 

Part of the job of operational risk managers is to try to increase awareness among 
employees of operational risk so that losses are prevented. They should also try to under- 
stand what determines operational risk losses and develop key risk indicators to track the 
level of operational risk in different parts of the organization. 

Once operational risk capital has been estimated, it is important to develop proce- 
dures for allocating it to business units. This should be done in a way that encourages 
business units to reduce operational risk when this can be done without incurring exces- 
sive costs. 

The power law introduced in Chapter 10 seems to apply to operational risk losses. 
This makes it possible to use extreme value theory (see Chapter 13) to estimate the tails 
of a loss distribution from empirical data. When several loss distributions are aggregated, 
it is the loss distribution with the heaviest tail that dominates. In principle, this makes the 
calculation of VaR for total operational risk easier. 

Many operational risks can be insured against. However, most policies include 
deductibles, coinsurance provisions, and policy limits. As a result, a bank is always left 
bearing part of any risk itself. Moreover, the way insurance premiums change as time 
passes is likely to depend on the claims made and other indicators that the insurance 
company has of how well operational risks are being managed. 

The whole process of measuring, managing, and allocating operational risk is still in 
its infancy. As time goes by and data are accumulated, improvements to our procedures 
for managing operational risk are likely to emerge. One of the key problems is that there 
are two sorts of operational risk: high-frequency low-severity risks and low-frequency 
high-severity risks. The former are relatively easy to quantify, but the latter pose the 
biggest risks. 

Bank supervisors seem to have succeeded in their objective of making banks more 
sensitive to the importance of operational risk. In many ways, the key benefit of an opera- 
tional risk management program lies in making managers more aware of the importance 
of operational risk and perhaps leading to them thinking about it differently. 
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Practice Questions and Problems 
(Answers at End of Book) 


23.1 What risks are included by regulators in their definition of operational risks? What 
risks are not included? 

23.2 Suppose that external data show that a loss of $100 million occurred at a bank 
with annual revenues of $1 billion. Your bank has annual revenues of $3 billion. 
What is the implication of the external data for losses that could occur at your 
bank? (Use Shih’s result.) 

23.3 Suppose that there is a 90% probability that operational risk losses of a certain 
type will not exceed $20 million. The power law parameter, a, is 0.8. What is the 
probability of losses exceeding (a) $40 million, (b) $80 million, and (c) $200 mil- 
lion? 

23.4 Discuss how moral hazard and adverse selection are handled in car insurance. 

23.5 Give two ways Sarbanes—Oxley affects the CEOs of public companies. 

23.6 When is a trading loss classified as a market risk and when is it classified as an 
operational risk? 

23.7 Discuss whether there is (a) moral hazard and (b) adverse selection in life insurance 
contracts. 

23.8 What are external loss data? How are they obtained? How are they used in deter- 
mining operational risk loss distributions for a bank? 

23.9 What distributions are commonly used for loss frequency and loss severity? 

23.10 Give two examples of key risk indicators that might be monitored by a central 
operational risk management group within a bank. 

23.11 The worksheet used to produce Figure 23.2 is on the author’s website. What is 
the mean and standard deviation of the loss distribution? Modify the inputs to the 
simulation to test the effect of changing the loss frequency from three to four. 


536 


RISK MANAGEMENT AND FINANCIAL INSTITUTIONS 


23.12 


23.13 


23.14 


23.15 


23.16 


23.17 


What are the key inputs to determining regulatory capital under the Standardized 
Measurement Approach? 

Consider two banks, each with a BI component of 300 million euros. Bank A has 
suffered 10 operational risk losses of 20 million euros in the past 10 years. Bank B 
has suffered one 200 million euro loss in the past 10 years. What is the operational 
risk regulatory capital for each bank under SMA? 


Further Questions 


Suppose that there is a 1% probability that operational risk losses of a certain 
type exceed $10 million. Use the power law to estimate the 99.97% worst-case 
operational risk loss when the a parameter equals (a) 0.25, (b) 0.5, (c) 0.9, and 
(d) 1.0. 

Consider the following two events: (a) A bank loses $1 billion from an unexpected 
lawsuit relating to its transactions with a counterparty and (b) an insurance com- 
pany loses $1 billion because of an unexpected hurricane in Texas. Suppose that 
you have the same investment in shares issued by both the bank and the insurance 
company. Which loss are you more concerned about? Why? 

The worksheet used to produce Figure 23.2 is on the author’s website. How does 
the loss distribution change when the loss severity has a beta distribution with 
upper bound of five, lower bound of zero, and the other parameters both one? 
A bank has a BI of of 5.5 billion euros. It has had eight operational risk losses in 
the past 10 years. The amounts of the losses in millions of euros are: 3, 7, 15, 65, 
85, 150, 250, and 300. What is the bank’s operational risk regulatory capital under 
SMA? 


Chapter 24 


Liquidity Risk 


he credit crisis that started in the middle of 2007 has emphasized the impor- 

tance of liquidity risk for both financial institutions and their regulators. Some 

financial institutions that relied on wholesale deposits for their funding experi- 

enced problems as investors lost confidence in financial institutions. Moreover, financial 

institutions found that some instruments for which there had previously been a liquid 
market could only be sold at fire-sale prices during the crisis. 

It is important to distinguish solvency from liquidity. Solvency refers to a company 
having more assets than liabilities, so that the value ofits equity is positive. Liquidity refers 
to the ability of a company to make cash payments as they become due. Financial institu- 
tions that are solvent can—and sometimes do—fail because of liquidity problems. Con- 
sider a bank whose assets are mostly illiquid mortgages. Suppose the assets are financed 
90% with deposits and 10% with equity. The bank is comfortably solvent. But it could fail 
if there is a run on deposits with 25% of depositors suddenly deciding to withdraw their 
funds. In this chapter we will examine how Northern Rock, a British bank specializing 
in mortgage lending, failed largely because of liquidity problems of this type. 

It is clearly important for financial institutions to manage liquidity carefully. Liquidity 
needs are uncertain. A financial institution should assess a worst-case liquidity scenario 
and make sure that it can survive that scenario by either converting assets into cash or 
raising cash in some other way. The new Basel III requirements are designed to ensure 
that banks do this. 

Liquidity is also an important consideration in trading. A liquid position in an asset 
is one that can be unwound at short notice. As the market for an asset becomes less 
liquid, traders are more likely to take losses because they face bigger bid—offer spreads. 
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For an option or other derivative, it is important for there to be a liquid market for the 
underlying asset so that the trader has no difficulty in doing the daily trades necessary to 
maintain delta neutrality (see Chapter 8). 

This chapter discusses different aspects of liquidity risk. It considers liquidity trading 
risk and liquidity funding risk. It also looks at what are termed “liquidity black holes.” 
These are situations where a shock to financial markets causes liquidity to almost com- 
pletely dry up. 


24.1 Liquidity Trading Risk 


Ifa financial institution owns 100, 1,000, 10,000, or even 100,000 shares in IBM, liquidity 
risk is not a concern. Several million IBM shares trade on the New York Stock Exchange 
every day. The quoted price of the shares is very close to the price that the financial 
institution would be able to sell the shares for. However, not all assets are as readily 
convertible into cash. For example, a $100 million investment in the bonds of a non- 
investment-grade U.S. company might be quite difficult to sell at close to the market 
price in one day. Shares and debt of companies in emerging markets are likely to be even 
less easy to sell. 
The price at which a particular asset can be sold depends on 


1. The mid-market price of the asset, or an estimate of its value 
2. How much of the asset is to be sold 

3. How quickly it is to be sold 

4. The economic environment 


When there is a market maker who quotes a bid and offer price for a financial asset, 
the financial institution can sell the asset at the bid and buy at the offer. However, it is 
usually stated (or understood) that a particular quote is good for trades up to a certain 
size. Above that size, the market maker is likely to increase the bid-offer spread. This is 
because the market maker knows that, as the size of a trade increases, the difficulty of 
hedging the exposure created by the trade also increases. 

When there is no market maker for a financial instrument, there is still an implicit 
bid—offer spread. If a financial institution approaches another financial institution (or an 
interdealer broker) to do a trade, the price depends on which side of the trade it wants 
to take. The bid—offer spread for an asset can vary from 0.05% of the assets mid-market 
price to as much as 5%, or even 10%, of its mid-market price. 

The general nature of the relationship between bid quotes, offer quotes, and trade 
size is indicated in Figure 24.1. The bid price tends to decrease and the offer price tends 
to increase with the size of a trade. For an instrument where there is a market maker, the 
bids and offers are the same up to the market maker’s size limit and then start to diverge. 

Figure 24.1 describes the market for large deals between sophisticated financial insti- 
tutions. It is interesting to note that bid—offer spreads in the retail market sometimes show 
the opposite pattern to that in Figure 24.1. Consider, for example, an individual who 
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Offer price 


Quantity 


Figure 24.1 Bid and Offer Prices as a Function of Quantity Transacted 


approaches a branch of a bank wanting to do a foreign exchange transaction or invest 
money for 90 days. As the size of the transaction increases, the individual is likely to get 
a better quote. 

The price that can be realized for an asset often depends on how quickly it is to 
be liquidated and the economic environment. Suppose you want to sell your house. 
Sometimes the real estate market is referred to as a “seller’s market.” Almost as soon as 
you put the house on the market, you can expect to get several offers and the house will 
be sold within a week. In other markets, it may take six months or more to sell the house. 
In the latter case, if you need to sell the house immediately, you will have to reduce the 
asking price well below the estimated market value. 

Financial assets are similar to real assets as far as this is concerned. Sometimes liquidity 
is tight (e.g., after the Russian default of 1998 and after the subprime crisis of 2007—2008). 
Liquidating even a relatively small position can then be time-consuming and is sometimes 
impossible. On other occasions, there is plenty of liquidity in the market and relatively 
large positions can be unwound without difficulty. 

Liquidating a large position can be affected by what is termed predatory trading. This 
occurs when a market participant, say Company X, has a large position and other market 
participants guess that it will have to be unwound in the near future. The other market 
participants attempt to profit by doing similar trades to those they expect from Company 
X. For example, if it is expected that Company X will have to sell a large position in a 
particular stock, they short the stock in anticipation ofa price decline. This makes it more 
difficult than it would otherwise be for Company X to exit from its position at com- 
petitive prices. To avoid predatory trading, financial institutions emphasize to employees 
the importance of keeping their positions and their future trading plans confidential. 
Predatory trading was an issue for the trader known as the London Whale in 2012 (see 
Business Snapshot 25.1) and for Metallgesellschaft (see Business Snapshot 24.3). In the 


540 RISK MANAGEMENT AND FINANCIAL INSTITUTIONS 


case of Long-Term Capital Management (see Business Snapshot 22.1), positions were 
unwound slowly over a period of time under the supervision of the Federal Reserve to 
avoid predatory trading. 

Another problem in the market for financial assets is that, when one financial institu- 
tion finds that it needs to unwind a position, it is often the case that many other financial 
institutions with similar positions need to do the same thing. The liquidity normally 
present in the market then evaporates. This is the “liquidity black hole” phenomenon 
that will be discussed later in this chapter. 


24.1.1 The Importance of Transparency 


One thing that the market has learned from the credit crisis of 2007 is that transparency 
is important for liquidity. If the nature of an asset is uncertain, it is not likely to trade in 
a liquid market for very long. 

As explained in Chapter 6, it had become common practice in the years prior to 
2007 to form portfolios of subprime mortgages and other assets and to create financial 
instruments by securitizing, re-securitizing, and re-re-securitizing the credit risk. Many 
of the financial instruments were even more complicated than indicated in Chapter 6. 
Sometimes the ABS CDOs that were created included non-mortgage assets and even 
tranches from other ABS CDOs. After August 2007, market participants realized that 
they knew very little about the risks in the instruments they had traded. Moreover, it was 
very difficult for them to find out very much about this. Belatedly, they realized they had 
been using credit ratings as a substitute for an understanding of the instruments. 

After August 2007, the instruments created from subprime mortgages became illiq- 
uid. Financial institutions had no idea how to mark to market investments that they had 
been scrambling to buy a few months earlier. They realized that they had purchased 
highly complicated credit derivatives and that they did not have the tools to value them. 
They lacked both the necessary models and solid information about the assets in the 
portfolios underlying the derivatives. 

Other well-defined credit derivatives, such as credit default swaps, continued to trade 
actively during the credit crisis. The lesson from all this is that the market can sometimes 
get carried away trading complex products that are not transparent, but, when it comes to 
its senses, liquidity for the products soon disappears. When the products do trade again, 
prices are likely to be low and bid-offer spreads are likely to be high. As mentioned in 
Chapter 6, in July 2008 Merrill Lynch agreed to sell $30.6 billion of ABS CDO tranches 
(previously rated AAA) to Lone Star Funds for 22 cents on the dollar. 


24.1.2 Measuring Market Liquidity 


One measure of the market liquidity of an asset is its bid—offer spread. This can be mea- 
sured either as a dollar amount or as a proportion of the asset price. The dollar bid—offer 
spread is 


p = Offer price — Bid price 
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The proportional bid—offer spread for an asset is defined as 


_ Offer price — Bid price 
= Mid-market price 


where the mid-market price is halfway between the bid and the offer price. Sometimes 
it is convenient to work with the dollar bid—offer spread, p, and sometimes with the 
proportional bid—offer spread, s. 

The mid-market price can be regarded as the fair price. In liquidating a position in 
an asset, a financial institution incurs a cost equal to sa/2 where æ is the dollar (mid- 
market) value of the position. This reflects the fact that trades are not done at the mid- 
market price. A buy trade is done at the offer price and a sell trade is done at the bid 
price. 

One measure of the liquidity of a book 1s how much it would cost to liquidate the 
book in normal market conditions within a certain time. Suppose that s; is an estimate 
of the proportional bid—offer spread in normal market conditions for the ith financial 
instrument held by a financial institution and q; is the dollar value of the position in the 
instrument. Then 


n sj; 
Cost of liquidation (normal market) = Dy oa (24.1) 
i=1 


where n is the number of positions. Note that although diversification reduces market 
risk, it does not necessarily reduce liquidity trading risk. However, as explained earlier, s; 
increases with the size of position i. Holding many small positions rather than a few large 
positions therefore tends to entail less liquidity risk. Setting limits to the size of any one 
position can therefore be one way of reducing liquidity trading risk. 


Example 24.1 


Suppose that a financial institution has bought 10 million shares of one company and 50 
million ounces of a commodity. The shares are bid $89.5, offer $90.5. The commodity 
is bid $15, offer $15.1. The mid-market value of the position in the shares is 90 X 10 = 
$900 million. The mid-market value of the position in the commodity is 15.05 X 50 = 
$752.50 million. The proportional bid—offer spread for the shares is 1/90 or 0.01111. 
The proportional bid—offer spread for the commodity is 0.1/15.05 or 0.006645. The 
cost of liquidation in a normal market is 


900 x 0.01111/2 + 752.5 x 0.006645/2 = 7.5 


or $7.5 million. 
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Another measure of liquidity is the cost of liquidation in stressed market conditions 
within a certain time period. Define p; and 6; as the mean and standard deviation of the 
proportional bid—offer spread for the ith financial instrument held. Then: 


á + ho.)\a. 
Cost of liquidation (stressed market) = 5 wr (24.2) 
i=1 


The parameter À gives the required confidence level for the spread. If, for example, we 
are interested in considering “worst case” spreads that are exceeded only 1% of the time, 
and if it is assumed that spreads are normally distributed, then A = 2.326. 


Example 24.2 


Suppose that in Example 24.1 the mean and standard deviation for the bid—offer spread 
for the shares are $1.0 and $2.0, respectively. Suppose further that the mean and stan- 
dard deviation for the bid—offer spread for the commodity are both $0.1. The mean 
and standard deviation for the proportional bid—offer spread for the shares are 0.01111 
and 0.02222, respectively. The mean and standard deviation for the proportional bid— 
offer spread for the commodity are both 0.006645. Assuming the spreads are normally 
distributed, the cost of liquidation that we are 99% confident will not be exceeded 
is 


0.5 X 900 x (0.01111 + 2.326 x 0.02222) 
+0.5 X 752.5 X (0.006645 + 2.326 x 0.006645) = 36.58 


or $36.58 million. This is almost five times the cost of liquidation in normal market 
conditions. 


In practice, bid—offer spreads are not normally distributed and it may be appropriate 
to use a value of A that reflects their empirical distribution. For example, if it is found 
that the 99 percentile point of the distribution is 3.6 standard deviations above the mean 
for a particular category of financial instruments, A can be set equal to 3.6 for those 
instruments. 

Equation (24.2) assumes that spreads in all instruments are perfectly correlated. This 
may seem overly conservative, but it is not. When liquidity is tight and bid—offer spreads 
widen, they tend to do so for all instruments. It makes sense for a financial institution 
to monitor changes in the liquidity of its book by calculating the measures in equa- 
tions (24.1) and (24.2) on a regular basis. As we have seen, the bid—offer spread depends 
on how quickly a position is to be liquidated. The measures in equations (24.1) and 
(24.2) are therefore likely to be decreasing functions of the time period assumed for the 
liquidation. 
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24.1.3 Liquidity-Adjusted VaR 


Market value at risk, which we discussed in Chapters 12 to 14, is designed to calculate an 
estimate of a “worst case” change in the mark-to-market valuation of the trading book. 
The measures in equations (24.1) and (24.2) are designed to calculate the cost of liqui- 
dating a book if market prices do not change. Although VaR and liquidity risk measures 
deal with different types of risks, some researchers have suggested combining them into 
a liquidity-adjusted VaR measure. One definition of liquidity-adjusted VaR is regular VaR 
plus the cost of unwinding positions in a normal market. From equation (24.1) this gives 


n 5. 
Liquidity-Adjusted VaR = VaR + )) = (24.3) 


i=1 


Alternatively it can be defined as regular VaR plus the cost of unwinding positions in a 
stressed market. From equation (24.2) this gives! 
+ Ao)a, 
Liquidity-Adjusted VaR = VaR + 2. Hitro 


i=1 


24.1.4 Unwinding a Position Optimally 


A trader wishing to unwind a large position in a financial instrument has to decide on 
the best trading strategy. If the position is unwound quickly, the trader will face large bid— 
offer spreads, but the potential loss from the mid-market price moving against the trader 
is small. If the trader chooses to take several days to unwind the position, the bid—offer 
spread the trader faces each day will be lower, but the potential loss from the mid-market 
price moving against the trader is larger. 

This type of problem is discussed by Almgren and Chriss (2001)? Suppose that the 
size of a position is V units and that a trader has to decide how to liquidate it over an 
n-day period. In this case, it is convenient to define the bid—offer spread in dollars rather 
than as a proportion. Define the dollar bid—offer spread when the trader trades q units 
in one day as p(q) dollars. Define q; as the units traded on day i and x; as the size of the 
trader’ position at the end of day i (1 < i < n). It follows that x, = x,_, — q,forl <i<n 
where xọ is defined as the initial position size, V. 

Each trade costs half the bid—offer spread. The total of the costs related to the bid— 
offer spread is therefore 


'This was suggested in A. Bangia, E Diebold, T. Schuermann, and J. Stroughair, “Liquidity on the 
Outside,” Risk 12 (June): 68-73. 

?See R. Almgren and N. Chriss, “Optimal Execution of Portfolio Transactions,” Journal of Risk 3 
(Winter 2001): 5-39. 
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Suppose that the mid-market price changes are normally distributed with a standard 
deviation of o per day and trading takes place at the beginning of a day. The variance 
of the change in the value of the trader’s position on day i is ox’. Assuming that price 
changes on successive days are independent, the variance of the change in the value of 
the position applicable to the unwind is therefore 


n 
Sots 
i=1 


A trader might reasonably wish to minimize VaR after trading costs have been con- 
sidered. This corresponds to minimizing something similar to the liquidity-adjusted VaR 
measure in equation (24.3). The trader’s objective is to choose the q; so that 


is minimized subject to 


Suey 
i=1 


with the x; being calculated from V and the q;, as indicated above. The parameter A mea- 


sures the confidence level in the VaR estimate. For example, when the confidence level 
is 99%, and daily price changes are assumed to be normally distributed, A = 2.326. Once 
the p(q) function has been estimated, Excel’s Solver can be used for the optimization. 


Example 24.3 


A trader wishes to unwind a position of 100 million units in an asset over five days. 
Suppose that the bid—offer spread p (measured in dollars) as a function of the daily trading 
volume is 


p(q) = at be 


where a = 0.1, b = 0.05, and c = 0.03 and the amount traded, q, is measured in millions 
of units. 

The standard deviation of the price change per day is $0.1. A spreadsheet for 
calculating the optimal strategy can be downloaded from the author’s website: www 
2.rotman.utoronto.ca/~hull/riskman. When the confidence level is 95%, the amounts 
that should be traded on days 1, 2, 3, 4, and 5 are 48.9, 30.0, 14.1, 5.1, and 1.9 million 
units, respectively. As the VaR confidence level is reduced, the amounts traded per day 
show less variability. For example, when the confidence level is 90%, they are 45.0, 29.1, 
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15.6, 7.0, and 3.3 million units, respectively. When the confidence level is 75% they are 
36.1, 26.2, 17.7,11.6, and 8.4 million units, respectively. In the limit when the confidence 
level is set equal to 50%, so that the trader is interested only in expected costs, not in the 
standard deviation of costs, 20 million units should be traded each day. 

As this example illustrates, when a position is to be closed out over n days, more than 
1/n of the position should be traded on the first day. This is because the longer any part 
of the position is held, the greater the risk of adverse market moves. 


24.1.5 Other Measures of Market Liquidity 


So far we have focused on bid—offer spread as a measure of market liquidity. Many other 
measures have been proposed. The volume of trading per day (i.e., the number of times 
the asset trades in a day) is an important measure. When an asset is highly illiquid, the 
volume of trading in a day is often zero. The price impact of a trade of a certain size 
is another measure. A measure somewhat similar to this, but more easily calculated, was 
proposed by Amihud (2002), It is the average of 


Absolute value of daily return 
Daily dollar volume 


over all days in the period considered. This measure is widely used by researchers. Amihud 
shows that an asset’s expected return increases as its liquidity decreases. In other words, 
investors do get compensated for illiquidity. 


24.2 Liquidity Funding Risk 


We now move on to consider liquidity funding risk. This is the financial institution’s 

ability to meet its cash needs as they arise. As mentioned at the outset of this chapter, 

liquidity is not the same as solvency. Financial institutions that are solvent (i.e., have 

positive equity) can, and sometimes do, fail because of liquidity problems. Northern 

Rock, a British mortgage lender, is a case in point (see Business Snapshot 24.1). 
Liquidity funding problems at a financial institution can be caused by: 


1. Liquidity stresses in the economy (e.g., a flight to quality such as that seen during 
the 2007 to 2009 crisis). Investors are then reluctant to provide funding in situations 
where there is any credit risk at all. 


3See Y. Amihud, “Illiquidity and Stock Returns: Cross-Section and Time-Series Effects,” Journal of 
Financial Markets 5 (2002): 31—56. 
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2. Overly aggressive funding decisions. There is a tendency for all financial institutions 
to use short-term instruments to fund long-term needs, creating a liquidity mis- 
match. Financial institutions need to ask themselves: “How much of a mismatch is 
too much?” 

3. A poor financial performance, leading to a lack of confidence. This can result in a 
loss of deposits and difficulties in rolling over funding. 


Often, when a company experiences severe liquidity problems, all three of these have 
occurred at the same time. The key to managing liquidity risk is predicting cash needs 
and ensuring that they can be met in adverse scenarios. Some cash needs are predictable. 
For example, if a bank has issued a bond, it knows when coupons will have to be paid. 
Others, such as those associated with withdrawals of deposits by retail customers and 
drawdowns by corporations on lines of credit that the bank has granted, are less pre- 
dictable. As the financial instruments entered into by financial institutions have become 
more complex, cash needs have become more difficult to predict. For example, down- 
grade triggers (see Section 20.2.3), guarantees provided by a financial institution, and 
possible defaults by counterparties in derivatives transactions can have an unexpected 
impact on cash resources. 


24.2.1 Sources of Liquidity 


The main sources of liquidity for a financial institution are: 


. Holdings of cash and Treasury securities 
. The ability to liquidate trading book positions 
. The ability to borrow money at short notice 


A U Ne 


. The ability to offer favorable terms to attract retail and wholesale deposits at short 
notice 


vi 


. The ability to securitize assets (such as loans) at short notice 
6. Borrowings from the central bank 


We now consider each of these in turn. 


Cash and Treasury Securities Cash and Treasury securities are excellent sources of liquidity. 
Cash is of course always available to meet liquidity needs and Treasury securities issued by 
countries such as the United States and the United Kingdom can generally be converted 
into cash at short notice without any problem. However, cash and Treasury securities are 
relatively expensive sources of liquidity. There is a trade-off between the liquidity of an 
asset and the return it provides. In order to be profitable, a financial institution needs to 
invest in assets such as loans to corporations that provide a higher rate of return than 
Treasury instruments. There is therefore a limit to the cash and Treasury securities that 
can reasonably be held. 
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BUSINESS SNAPSHOT 24.1 
Northern Rock 


Northern Rock, a British bank, was founded in 1997 when the Northern Rock 
Building Society floated shares on the London Stock Exchange. In 2007, it was 
one of the top five mortgage lenders in the United Kingdom. It had 76 branches 
and offered deposit accounts, savings accounts, loans, and house/contents insur- 
ance. The bank grew rapidly between 1997 and 2007. Some of its mortgages were 
securitized through a subsidiary, Granite, that was based in the Channel Islands. 

Northern Rock relied on selling short-term debt instruments for much of 
its funding. Following the subprime crisis of August 2007, the bank found it very 
difficult to replace maturing instruments because institutional investors became 
very nervous about lending to banks that were heavily involved in the mortgage 
business. The bank’s assets were sufficient to cover its liabilities so it was not 
insolvent. To quote from the Financial Services Authority (FSA) in September 
2007: “The FSA judges that Northern Rock is solvent, exceeds its regulatory 
capital requirement, and has a good quality loan book.” But Northern Rock’s 
inability to fund itself was a serious problem. It approached the Bank of England 
for funding on September 12, 2007, and borrowed about £3 billion from the 
Tripartite Authority (Bank of England, the Financial Services Authority, and 
HM Treasury) in the following few days. 

On September 13, 2007, the BBC business editor Robert Peston broke the 
news that the bank had requested emergency support from the Bank of England. 
On Friday, September 14, there was a run on the bank. Thousands of people lined 
up for hours to withdraw their funds. This was the first run on a British bank in 
150 years. Some customers held their funds in an “Internet-only” account, which 
they were unable to access due to the volume of customers trying to log on. On 
Monday, September 17, worried savers continued to withdraw their funds. An 
estimated £2 billion was withdrawn between September 12 and September 17, 
2007. 

Depositor insurance in the UK guaranteed 100% of the first £2,000 and 
90% of the next £33,000. Late on September 17, 2007, the British Chancellor of 
the Exchequer, Alistair Darling, announced that the British government and the 
Bank of England would guarantee all deposits held at Northern Rock. As a result 
of this announcement and subsequent advertisements in major UK newspapers, 
the lines outside Northern Rock’s branches gradually disappeared. Northern 
Rock’s shares, which had fallen from £12 earlier in the year to £2.67, rose 16% 
on Mr. Darling’s announcement. 

During the months following September 12, 2007, Northern Rock’s emer- 
gency borrowing requirement increased. The Bank of England insisted on a 
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penalty rate of interest to discourage other banks from taking excessive risks. 
Northern Rock raised some funds by selling assets, but by February 2008 the 
emergency borrowing reached £25 billion. The bank was then nationalized and 
the management of the bank was changed. It was split into Northern Rock 
ple and Northern Rock (Asset Management), with the company’s bad debt 
being put in Northern Rock (Asset Management). In November 2011, North- 
ern Rock plc was bought from the British government for £747 million by the 
Virgin Group, which is headed by the colorful entrepreneur Sir Richard Branson. 

The Northern Rock story illustrates just how quickly liquidity problems 
can lead to a bank spiraling downward. If the bank had been managed a little 
more conservatively and had paid more attention to ensuring that it had access 
to funding, it might have survived. 


Liquidating Tiading Book Positions Liquidity funding risk is related to liquidity trading risk, 
considered in Section 24.1, because one way a financial institution can meet its funding 
requirements is by liquidating part of its trading book. It is therefore important for a 
financial institution to quantify the liquidity of its trading book so that it knows how 
easy it would be to use the book to raise cash. The financial institution wants to make 
sure that it will be able to survive stressed market conditions where there is a general 
shortage of liquidity. The financial institution’s analysis should therefore be based on 
stressed market conditions, not normal market conditions. This is the reason why the 
measures discussed in the previous section concerned with bid—offer spreads in stressed 
markets are important. 


Ability to Borrow When markets are unstressed, a creditworthy bank usually has no 
problem in borrowing money, but in stressed market conditions there is a heightened 
aversion to risk. This leads to higher interest rates, shorter maturities for loans, and in 
some cases a refusal to provide funds at all. Financial institutions should monitor the 
assets that can be pledged as collateral for loans at short notice. A financial institu- 
tion can (at a cost) mitigate its funding risks somewhat by arranging lines of credit. 
For example, Countrywide, an originator of mortgages in the United States, had a 
syndicated loan facility of $11.5 billion, which it was able to use during the credit 
crisis of 2007. (This helped keep the company afloat, but it still experienced signifi- 
cant problems and was taken over by Bank of America in January 2008.) As Business 
Snapshot 24.1 shows, Northern Rock, a similar British mortgage lender, did not fare 
so well. 


Wholesale and Retail Deposits Wholesale deposits are a more volatile source of funding 
than retail deposits and can disappear quickly in stressed market conditions. Even retail 
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deposits are not as stable as they used to be because it is very easy to compare interest 
rates oftered by different financial institutions and make transfers via the Internet. Unfor- 
tunately, liquidity problems tend to be market-wide rather than something that affects 
one or two financial institutions. When one financial institution wants to increase its 
retail or wholesale deposit base for liquidity reasons by offering more attractive rates of 
interest, others usually want to do the same thing and the increased funding is likely to 
be difficult to achieve. 


Securitization As mentioned in Chapter 2, banks have found the “originate-to-distribute” 
model attractive. Rather than keep illiquid assets such as loans on their balance sheet, they 
have securitized them. The structures developed for doing this were discussed in Chapter 
6. Prior to August 2007, securitization was an important source of liquidity for banks. 
However, this source of liquidity dried up almost overnight in August 2007 as investors 
decided that the securitized products were too risky. “Originate-to-distribute” was no 
longer a viable strategy, and banks had to fund their loans. Not surprisingly, banks became 
a lot less willing to lend. 

Securitization led to other liquidity problems in August 2007. Banks had entered 
into liquidity backstop arrangements on the asset-backed commercial paper (ABCP) 
that was used to fund debt instruments, such as mortgages, prior to their securitization. 
When buyers could not be found, they had to buy the instruments themselves. In some 
cases, in order to avoid their reputations being adversely impacted, they had to provide 
financial support to conduits and other off-balance-sheet vehicles that were involved in 
securitization, even though they were not legally required to do so. 


Central Bank Borrowing Central banks (e.g., the Federal Reserve Board in the United 
States, the Bank of England in the UK, or the European Central Bank) are often referred 
to as “lenders of last resort”? When commercial banks are experiencing financial diffi- 
culties, central banks are prepared to lend money to maintain the health of the financial 
system. Collateral has to be posted by the borrowers and the central bank typically applies 
a haircut (i.e., it lends less than 100% of the value of the collateral) and may charge a 
relatively high rate of interest. In March 2008, after the failure of Bear Stearns (which 
was taken over by JPMorgan Chase), the Federal Reserve Board extended its borrowing 
facility to investment banks as well as commercial banks.’ Later, it also made the facility 
available to Fannie Mae and Freddie Mac (which were taken over by the government in 
September 2008). 


*Central banks are concerned about the failure of investment banks because of systemic risk (see 
Business Snapshot 15.1). Investment banks have derivatives contracts with other investment banks and 
with commercial banks. There is a danger that, because of the huge amount of trading between banks, 
a failure by an investment bank will have a ripple effect throughout the financial sector leading to a 
failure by commercial banks. 
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BUSINESS SNAPSHOT 24.2 
Ashanti Goldfields 


Ashanti Goldfields, a West African gold-mining company based in Ghana, expe- 
rienced problems resulting from its hedging program in 1999. It had sought to 
protect its shareholders from gold price declines by selling gold forward. On 
September 26, 1999, 15 European central banks surprised the market with an 
announcement that they would limit their gold sales over the following five years. 
The price of gold jumped up over 25%. Ashanti was unable to meet margin calls 
and this resulted in a major restructuring, which included the sale of a mine, a 
dilution of the interest of its equity shareholders, and a restructuring of its hedge 
positions. 


Different central banks apply different rules. Following the credit crisis of August 
2007, the haircuts used by the European Central Bank (ECB) were lower than those 
of other central banks. As a result, some British banks preferred to borrow from the 
European Central Bank (ECB) rather than the Bank of England. (There are even stories 
of North American banks contemplating the setting up of subsidiaries in Ireland to access 
the ECB.) By September 2008, the ECB had lent 467 billion euros and announced that 
it would apply larger haircuts in the future. 

Banks try to keep their borrowing from a central bank a secret. There is a danger that 
the use of central bank borrowings will be interpreted by the market as a sign that the 
bank is experiencing financial difficulties with the result that other sources of liquidity dry 
up. As Business Snapshot 24.1 discusses, news that Northern Rock required emergency 
borrowing led to an immediate run on the bank, exacerbating its liquidity problems. 


Hedging Issues Liquidity problems are liable to arise when companies hedge illiquid assets 
with contracts that are subject to margin requirements. As indicated in Business Snap- 
shot 8.1, gold-mining companies often hedge their risks by entering into agreements 
with financial institutions to sell gold forward for two or three years. Often the gold- 
mining company is required to post margin and the amount of the margin required is 
calculated every day to reflect the value of its forward agreements. If the price of gold 
rises fast, the forward agreements lose money and result in big margin calls being made by 
the financial institution on the gold-mining company. The losses on the forward agree- 
ments are offset by increases in the value of the gold in the ground—but this is an illiquid 
asset. As discussed in Business Snapshot 24.2, Ashanti Goldfields was forced to undertake 
a major restructuring when it could not meet margin calls after a sharp rise in the price 


of gold. 
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BUSINESS SNAPSHOT 24.3 
Metallgesellschaft 


In the early 1990s, Metallgesellschaft (MG) sold a huge volume of 5- to 10-year 
heating oil and gasoline fixed-price supply contracts to its customers at six to 
eight cents above market prices. It hedged its exposure with long positions in 
short-dated futures contracts that were rolled forward. As it turned out, the price 
of oil fell and there were margin calls on the futures positions. MG’s trading was 
made more difficult by the fact that its trades were very large and were anticipated 
by others. 

Considerable short-term cash flow pressures were placed on MG. The mem- 
bers of MG who devised the hedging strategy argued that these short-term cash 
outflows were offset by positive cash flows that would ultimately be realized on 
the long-term fixed-price contracts. However, the company’s senior manage- 
ment and its bankers became concerned about the huge cash drain. As a result, 
the company closed out all the hedge positions and agreed with its customers 
that the fixed-price contracts would be abandoned. The outcome was a loss to 
MG of $1.33 billion. 


Another extreme example of a liquidity funding problem caused by hedging is pro- 
vided by a German company, Metallgesellschaft, that entered into profitable fixed-price 
oil and gas contracts with its customers (see Business Snapshot 24.3). The lesson from the 
Ashanti and Metallgesellschaft episodes is not that companies should not use forward and 
futures contracts for hedging, but rather that they should ensure that they have access to 
funding to handle the cash flow mismatches that might arise in extreme circumstances. 


24.2.2 Reserve Requirements 


In some countries there are reserve requirements that require banks to keep a certain per- 
centage of deposits as cash in the bank’s vault or on deposit with the central bank. The 
reserve requirement applies only to transaction deposits (in essence, those made to a 
checking account). For large banks in the United States, the reserve requirement is cur- 
rently about 10%. Some countries, such as Canada and the United Kingdom, have no 
compulsory reserve requirements. Others have higher compulsory reserve requirements 
than the United States. 

In addition to ensuring that banks keep a minimum amount of liquidity, reserve 
requirements affect the money supply. When the reserve requirement is 10%, a $100 
deposit leads to $90 of lending, which leads to a further $90 of deposits in the banking 
system, which leads to further $81 of lending, and so on. As this process continues, the 
total money supply (M1) that is created is 90 + 81+ 72.9 +... or $900. If the reserve 
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requirement is 20%, a $100 deposit leads to $80 of lending, which leads to $64 of lending, 
and so on. The total increase in the money supply is 80 + 64 + 51.2 +... or $400. Most 
countries do not use the reserve requirement as a way of controlling the money supply. 
An exception appears to be China, where the reserve requirement is changed frequently. 


24.2.3 Regulation 


As explained in Chapter 16, Basel III introduced two liquidity risk requirements: the 
liquidity coverage ratio (LCR) and the net stable funding ratio (NSFR). 
The LCR requirement is 


High-quality liquid t 
gh-qua oe quid assets > 100% 
Net cash outflows in a 30-day period 


The 30-day period considered in the calculation of LCR is one of acute stress involving a 
downgrade of three notches (e.g., from AA+ to A+), a partial loss of deposits, a complete 
loss of wholesale funding, increased haircuts on secured funding, and drawdowns on lines 
of credit. LCR was implemented in stages between 2015 and 2019. (The required ratio 
was 60% in 2015 and 100% in 2019.) 

The NSFR requirement is 


Amount of stable funding 
— MlM > 100% 
Required amount of stable funding 


The numerator is calculated by multiplying each category of funding (capital, wholesale 
deposits, retail deposits, etc.) by an available stable funding (ASF) factor, reflecting their 
stability (see Table 16.4). The denominator is calculated from the assets and off-balance- 
sheet items requiring funding. Each category of these is multiplied by a required stable 
funding (RSF) factor to reflect the permanence of the funding (see Table 16.5). The 
implementation date for the NSFR requirement is January 1, 2018. 

Following the liquidity crisis of 2007, bank regulators issued a revised set of principles 
on how banks should manage liquidity” These are as follows: 


1. A bank is responsible for the sound management of liquidity risk. A bank should 
establish a robust liquidity risk management framework that ensures it maintains suf- 
ficient liquidity, including a cushion of unencumbered, high-quality liquid assets, to 
withstand a range of stress events, including those involving the loss or impairment of 
both unsecured and secured funding sources. Supervisors should assess the adequacy 
of both a bank’s liquidity risk management framework and its liquidity position and 
should take prompt action if a bank is deficient in either area in order to protect 
depositors and to limit potential damage to the financial system. 


>See Bank for International Settlements, “Principles for Sound Liquidity Risk Management and 
Supervision,’ September 2008. 
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2. 


3. 


10. 


A bank should clearly articulate a liquidity risk tolerance that is appropriate for its 
business strategy and its role in the financial system. 

Senior management should develop a strategy, policies, and practices to manage liq- 
uidity risk in accordance with the risk tolerance and to ensure that the bank maintains 
sufficient liquidity. Senior management should continuously review information on 
the bank’s liquidity developments and report to the board of directors on a regular 
basis. A bank’s board of directors should review and approve the strategy, policies, 
and practices related to the management of liquidity at least annually and ensure that 
senior management manages liquidity risk effectively. 


. A bank should incorporate liquidity costs, benefits, and risks in the internal pric- 


ing, performance measurement, and new product approval process for all significant 
business activities (both on- and off-balance-sheet), thereby aligning the risk-taking 
incentives of individual business lines with the liquidity risk exposures their activities 
create for the bank as a whole. 


. A bank should have a sound process for identifying, measuring, monitoring, and 


controlling liquidity risk. This process should include a robust framework for com- 
prehensively projecting cash flows arising from assets, liabilities, and off-balance-sheet 
items over an appropriate set of time horizons. 


. A bank should actively monitor and control liquidity risk exposures and fund- 


ing needs within and across legal entities, business lines, and currencies, taking 
into account legal, regulatory, and operational limitations to the transferability of 


liquidity. 


. A bank should establish a funding strategy that provides effective diversification in the 


sources and tenor of funding. It should maintain an ongoing presence in its chosen 
funding markets and strong relationships with funds providers to promote effective 
diversification of funding sources. A bank should regularly gauge its capacity to raise 
funds quickly from each source. It should identify the main factors that affect its 
ability to raise funds and monitor those factors closely to ensure that estimates of 
fund-raising capacity remain valid. 


. A bank should actively manage its intraday liquidity positions and risks to meet 


payment and settlement obligations on a timely basis under both normal and stressed 
conditions and thus contribute to the smooth functioning of payment and settlement 
systems. 


. A bank should actively manage its collateral positions, differentiating between 


encumbered and unencumbered assets. A bank should monitor the legal entity and 
physical location where collateral is held and how it may be mobilized in a timely 
manner. 

A bank should conduct stress tests on a regular basis for a variety of short-term and 
protracted institution-specific and market-wide stress scenarios (individually and in 
combination) to identify sources of potential liquidity strain and to ensure that cur- 
rent exposures remain in accordance with a bank’s established liquidity risk tolerance. 
A bank should use stress test outcomes to adjust its liquidity risk management strate- 
gies, policies, and positions and to develop effective contingency plans. 
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11. 


12. 


13. 


14. 


15. 


16. 


17. 


A bank should have a formal contingency funding plan (CFP) that clearly sets out the 
strategies for addressing liquidity shortfalls in emergency situations. A CFP should 
outline policies to manage a range of stress environments, establish clear lines of 
responsibility, include clear invocation and escalation procedures, and be regularly 
tested and updated to ensure that it is operationally robust. 

A bank should maintain a cushion of unencumbered, high-quality liquid assets to 
be held as insurance against a range of liquidity stress scenarios, including those that 
involve the loss or impairment of unsecured and typically available secured funding 
sources. There should be no legal, regulatory, or operational impediment to using 
these assets to obtain funding. 

A bank should publicly disclose information on a regular basis that enables market 
participants to make an informed judgment about the soundness of its liquidity risk 
management framework and liquidity position. 


Recommendations for banks supervisors are: 


Supervisors should regularly perform a comprehensive assessment of a bank’s overall 
liquidity risk management framework and liquidity position to determine whether 
they deliver an adequate level of resilience to liquidity stress given the bank’s role in 
the financial system. 

Supervisors should supplement their regular assessments of a bank’s liquidity risk 
management framework and liquidity position by monitoring a combination of 
internal reports, prudential reports, and market information. 

Supervisors should intervene to require effective and timely remedial action by a 
bank to address deficiencies in its liquidity risk management processes or liquidity 
position. 

Supervisors should communicate with other supervisors and public authorities, such 
as central banks, both within and across national borders, to facilitate effective cooper- 
ation regarding the supervision and oversight of liquidity risk management. Commu- 
nication should occur regularly during normal times, with the nature and frequency 
of the information sharing increasing as appropriate during times of stress. 


24.3 Liquidity Black Holes 


It is sometimes argued that technological and other developments have led to an improve- 


ment in the liquidity of financial markets. This is questionable. It is true that bid—offer 


spreads have on average declined. But there has also been an increasing tendency for 


situations to develop where almost everyone wants to do the same type of trade at the 


same time. The result has been that what are referred to as “liquidity black holes” occur 


with increasing frequency A liquidity black hole describes a situation where liquidity 


®See A. D. Persaud, ed., Liquidity Black Holes: Understanding, Quantifying and Managing Financial Liquidity 
Risk (London: Risk Books, 1999). 
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has dried up in a particular market because everyone wants to sell and no one wants to 
buy, or vice versa. It is sometimes also referred to as a “crowded exit.” 

In a well-functioning market, the market may change its opinion about the price 
of an asset because of new information. However, the price does not overreact. If a 
price decrease is too great, traders will quickly move in and buy the asset and a new 
equilibrium price will be established. A liquidity black hole is created when a price 
decline causes more market participants to want to sell, driving prices well below where 
they will eventually settle. During the sell-off, liquidity dries up and the asset can be sold 


only at a fire-sale price’ 


24.3.1 Positive and Negative Feedback Traders 


Changes in the liquidity of financial markets are driven by the behavior of traders. There 
are two sorts of traders in the market: negative feedback traders and positive feedback 
traders.” Negative feedback traders buy when prices fall and sell when prices rise; positive 
feedback traders sell when prices fall and buy when prices rise. 

In liquid markets, negative feedback traders dominate the trading. If the price of an 
asset gets unreasonably low, traders will move in and buy. This creates demand for the 
asset that restores the price to a more reasonable level. Similarly, if the price of an asset 
gets unreasonably high, traders will sell. This creates supply of the asset that also restores 
the price to a more reasonable level. The result is that the market is liquid with reasonable 
prices and a good balance of buyers and sellers. 

When positive feedback traders dominate the trading, market prices are liable to be 
unstable and the market may become one-sided and illiquid. A reduction in the price of 
an asset causes traders to sell. This results in prices falling further and more selling. An 
increase in the price of an asset causes traders to buy. This results in the price of the asset 
increasing further and more buying. 

There are a number of reasons why positive feedback trading exists. For example: 


1. Trend trading. Trend traders attempt to identify trends in an asset price. They buy when 
the asset price appears to be trending up and sell when it appears to be trending 
down. A related strategy is breakout trading, which involves trading when an asset’s 
price moves outside an established range. For example, if a stock has been trading in 
the range of $25 to $30 for the last six months, traders might be inclined to buy it as 
soon as it moves above $30 and sell it as soon as it moves below $25. 

2. Stop-loss rules. Traders often have rules to limit their losses. When the price of an 
asset that is owned falls below a certain level, they automatically sell to limit their 


7See, for example, J. Clunie, Predatory Trading and Crowded Exits: New Thinking on Market Volatility 
(Petersfield, UK: Harriman House, 2010). 

8 Liquidity black holes tend to be associated with price decreases, but it is in theory also possible for 
them to occur when there are price increases. 

? This is a simplification of reality to help understand the dynamics of markets. Some traders follow 
complicated strategies that cannot be classified as positive feedback or negative feedback. 
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losses. These rules are known as “‘stop-loss” rules and are a source of positive feedback 
trading that is always present in the market. 


. Dynamic hedging. Chapter 8 explains how options traders maintain a delta-neutral 


position using dynamic hedging. In particular, Tables 8.2 and 8.3 show how a trader 
would hedge a short position in an option over a 20-week period. Hedging a short 
option position (call or put) involves buying after a price rise and selling after a price 
decline. This is positive feedback trading that has the potential to reduce liquidity. 
(By contrast, dynamically hedging a long position in a call or put option involves 
selling after a price rise and buying after a price decline. This is negative feedback 
trading and should not interfere with market liquidity.) Any situation where banks 
have a large short option position has the potential to destabilize the market and 
cause illiquidity. As discussed in Business Snapshot 3.1, at one point banks sold a 
huge volume of options on long-term interest rates to British insurance companies. 
As the banks hedged their risks, the behavior of long-term interest rates in the UK 
was dramatically affected. 


. Creating options synthetically. Hedging a short position in an option is equivalent to 


creating a long position in the same option synthetically. It follows that a financial 
institution can create a long option position synthetically by doing the same sort of 
trading as it would do if it were hedging a short option position. This leads to positive 
feedback trading that can cause market instability and illiquidity. The classic example 
here is the stock market crash of October 1987. In the period leading up to the crash, 
the stock market had done very well. Increasing numbers of portfolio managers were 
using commercially available programs to synthetically create put options on their 
portfolios. These programs told them to sell part of their portfolio immediately after 
a price decline and buy it back immediately after a price increase. The result, as 
indicated in Business Snapshot 24.4, was a liquidity black hole where prices plunged 
on October 19, 1987. In this case, the liquidity black hole was relatively short-lived. 
Within four months the market recovered to close to its pre-crash level. 


. Margins. A big movement in market variables, particularly for traders who are highly 


leveraged, may lead to margin calls that cannot be met. This forces traders to close 
out their positions, which reinforces the underlying move in the market variables. It 
is likely that volatility increases. This may exacerbate the situation because it leads to 
exchanges increasing their margin requirements. 


. Predatory trading. This was mentioned in Section 24.1. If traders know that an investor 


must sell large quantities of a certain asset, they know that the price of the asset is 
likely to decrease. They therefore short the asset. This reinforces the price decline 
and results in the price falling even further than it would otherwise do. To avoid 
predatory trading, large positions must usually be unwound slowly. 


. LTCM. The failure of the hedge fund Long-Term Capital Management (LTCM), as 


outlined in Business Snapshot 22.1, provides an example of positive feedback trading. 
One type of LT'CM’s trade was “relative value fixed income.” LTCM would take a 
short position in a liquid bond and a long position in a similar illiquid bond, and wait 
for the prices to move close together. After the Russian default in 1998, the prices of 
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BUSINESS SNAPSHOT 24.4 
The Crash of 1987 


On Monday, October 19, 1987, the Dow Jones Industrial Average dropped by 
more than 20%. Portfolio insurance played a major role in this crash. In October 
1987, portfolios involving over $60 billion of equity assets were being managed 
with trading rules that were designed to synthetically create put options on the 
portfolios. These trading rules involved selling equities (or selling index futures) 
when the market declined and buying equities (or buying equity futures) when 
the market rose. 

During the period Wednesday, October 14, 1987, to Friday, October 16, 
1987, the market declined by about 10%, with much of this decline taking 
place on Friday afternoon. The portfolio insurance rules should have generated 
at least $12 billion of equity or index futures sales as a result of this decline. 
In fact, portfolio insurers had time to sell only $4 billion and they approached 
the following week with huge amounts of selling already dictated by their 
models. It is estimated that on Monday, October 19, sell programs by three 
portfolio insurers accounted for almost 10% of the sales on the New York 
Stock Exchange, and that portfolio insurance sales amounted to 21.3% of all 
sales in index futures markets. It is likely that the decline in equity prices was 
exacerbated by investors other than portfolio insurers selling heavily because 
they anticipated the actions of portfolio insurers. 

Because the market declined so fast and the stock exchange systems were 
overloaded, many portfolio insurers were unable to execute the trades generated 
by their models and failed to obtain the protection they required. Needless 
to say, the popularity of portfolio insurance schemes has declined significantly 
since 1987. One of the lessons from this story is that it is dangerous to follow a 
particular trading strategy—even a hedging strategy—when many other market 
participants are doing the same thing. To quote from a report on the crash, 
“Liquidity sufficient to absorb the limited selling demands of investors became 
an illusion of liquidity by massive selling, as everyone showed up on the same 
side of the market at once. Ironically, it was this illusion of liquidity which 
led certain similarly motivated investors, such as portfolio insurers, to adopt 


strategies which call for liquidity far in excess of what the market could supply.” 


illiquid instruments declined relative to similar liquid instruments. LTCM (and other 


companies that were following similar strategies to LTCM) were highly leveraged 


and unable to meet margin calls. They were forced to close out their positions. This 


involved buying the liquid bonds and selling the illiquid bonds. This reinforced the 


flight to quality and made the prices of illiquid and liquid bonds diverge even further. 
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Investors allowed to increase leverage. 
They buy more assets. 


Asset prices increase. 
Leverage of investors decreases. 


Figure 24.2 Leveraging 


24.3.2 Leveraging and Deleveraging 


A phenomenon in the market is leveraging and deleveraging. This is illustrated in Figures 
24.2 and 24.3. When banks are awash with liquidity (e.g., because they have developed 
ways of securitizing assets or because deposit levels are higher than usual), they make 
credit easily available to businesses, investors, and consumers. Credit spreads decrease. 
The easy availability of credit increases demand for both financial and nonfinancial assets 
and the prices of these assets rise. Assets are often pledged as collateral for the loans that 
are used to finance them. When the prices of the assets rise, the collateral underlying 
loans (when measured at market prices) is greater and borrowing can increase further. 
This leads to further asset purchases and a repeat of the cycle. This cycle is referred to as 
“leveraging” because it leads to more borrowing throughout the economy. 
Deleveraging is the opposite process to leveraging. Banks find themselves less liquid 
for some reason (e.g., because there is less demand for the products of securitization). 
They become more reluctant to lend money. Credit spreads increase. There is less demand 


for both nonfinancial and financial assets and their prices decrease. The value of the 


Investors required to reduce leverage. 
Investors do this by selling assets. 


Asset prices decline. 
Leverage of investors increases. 


Figure 24.3 Deleveraging 
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collateral supporting loans decreases and banks reduce lines of credit. This leads to asset 
sales being necessary and a further reduction in asset prices. 

The period leading up to 2007 was characterized by leveraging for many of 
the world’s economies. Credit spreads declined and it was relatively easy to borrow 
money for a wide range of different purposes. From the middle of 2007 onward, 
the situation changed and the deleveraging process shown in Figure 24.3 started. 
Credit spreads increased, it became much less easy to borrow money, and asset prices 
decreased. 

Hedge funds are particularly affected by the leveraging—deleveraging cycle. Consider 
a hedge fund that is able to borrow 20 times its equity during the pre-2007 period. Soon 
after the middle of 2007, the hedge fund might get a call from its prime broker telling 
it to reduce leverage to, say, five times equity. It can only do this by selling assets. Asset 
prices decrease as a result of what this hedge fund, and other hedge funds, are doing. The 
hedge fund’s equity declines and further sales are necessary. 


24.3.3 Irrational Exuberance 


The term “irrational exuberance” was used by Alan Greenspan, Federal Reserve Board 
chairman, in a speech in December 1996 when, in reference to the stock market, he 
asked, “How do we know when irrational exuberance has unduly escalated asset val- 
ues?” (The phrase has been remembered because the speech was followed by declines in 
stock prices worldwide.) Most liquidity black holes can be traced to irrational exuberance 
of one sort or another. What happens is that traders working for many different financial 
institutions become irrationally exuberant about a particular asset class or a particular 
market variable. The balance sheets of financial institutions then become overextended 
through the accumulation of exposure to this asset class or market variable. Often the 
process is self-reinforcing. When many financial institutions choose to take a particular 
position, prices increase, making the position look profitable. This creates extra desire 
on the part of financial institutions to take the position and yet more profits. Risk man- 
agers working for the financial institution should (and probably will) complain about 
the risks being taken, but in many instances senior management are likely to ignore 
their concerns because high profits are being made. To quote Chuck Prince, ex-CEO of 
Citigroup, on July 10, 2007: “When the music stops, in terms of liquidity, things will be 
complicated. But as long as the music is playing, you’ve got to get up and dance. We’re still 
dancing.” 

At some stage the bubble must burst. Many traders then try to get out of their 
positions at the same time, causing illiquid markets and huge losses. Volatility increases and 
the risk management procedures used within the financial institution (e.g., the calculation 
of market VaR from historical data) can cause many financial institutions to try to unwind 
a wide range of risky positions at the same time. This can lead to further losses and more 
serious illiquidity problems. There may be failures (or rumors of failures) by some banks. 
Most banks are likely to experience liquidity funding problems and as a result lending 
may be curtailed. 
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The classic example of what has been described above is the subprime crisis that 
started in 2007. Other examples are the 1987 stock market crash, the 1994 bond market 
crash, the 1997—1998 Asian monetary crisis, and the 1998 Long-Term Capital Man- 
agement failure. Irrational exuberance is part of human nature and to some extent is 
inevitable. As discussed in Chapter 6, it is exacerbated by the way traders are paid. A large 
part of the compensation comes from a bonus at year-end, which depends on perfor- 
mance during the year. A trader may be savvy enough to know that a market is irrationally 
exuberant and that there will be a correction. However, if there is a good chance that the 
correction will be delayed until next year, the trader is motivated to continue building 
up his or her position to maximize short-term compensation. 


24.3.4 The Impact of Regulation 


In many ways it is a laudable goal on the part of regulators to seek to ensure that banks 
throughout the world are regulated in the same way. As explained in Chapter 15, capital 
requirements and the extent to which they were enforced varied from country to country 
prior to Basel I. Banks were competing globally and as a result a bank subject to low 
capital requirements, or capital requirements that were not strictly enforced, found it 
easier to take risks and was therefore able to be more competitive in the pricing of some 
products. 

However, a uniform regulatory environment comes with costs. All banks tend to 
respond in the same way to external events. Consider for example market risk. When 
volatilities and correlations increase, market risk VaR and the capital required for market 
risks increase. Banks then take steps to reduce their exposures. Since banks often have 
similar positions to each other, they try to do similar trades. A liquidity black hole can 
develop. 

There is a similar issue as far as credit risk is concerned. During the low point of the 
economic cycle, default probabilities are relatively high and capital requirements for loans 
under the Basel II internal-ratings-based models tend to be high. As a result, banks may 
be less willing to make loans, creating problems for small and medium-sized businesses. 
During the high point of the business cycle, default probabilities are relatively low and 
banks may be too willing to grant credit. (This is similar to the phenomenon described in 
Figures 24.2 and 24.3.) The Basel Committee has recognized this problem and has dealt 
with it by asserting that the probability of default should be an average of the probability 
of default through the economic or credit cycle, rather than an estimate applicable to 
one particular point in time. 

Should other financial institutions such as life insurance companies and pension funds 
be regulated in the same way as banks? It is tempting to answer “yes” so that one financial 
institution is not given an advantage over others. But the answer should be “no.” These 
financial institutions have longer time horizons than banks. They should not be penalized 
for investing in illiquid assets. Also, they should not be required to adjust their portfolios 
when volatilities and correlations increase. These parameters tend to be mean reverting 
and so they eventually decrease again. 
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24.3.5 The Importance of Diversity 


Models in economics usually assume that market participants act independently of each 
other. We have argued that this is often not the case. It is this lack of independence that 
causes liquidity black holes. Traders working for financial institutions tend to want to 
do the same trades at the same time. To solve the problem of liquidity black holes, we 
need more diversity in financial markets. One way of creating diversity is to recognize 
that different types of financial institutions have different types of risks and should be 
regulated differently. 

Hedge funds have become important market participants. They are much less reg- 
ulated than banks or insurance companies and can follow any trading strategy they like. 
To some extent they do add diversity (and therefore liquidity) to the market. But, as 
mentioned earlier, hedge funds tend to be highly leveraged. When liquidity tightens as 
it did in the second half of 2007, all hedge funds have to unwind positions accentuating 
the liquidity problems. 

One conclusion from the arguments we have put forward is that a contrarian invest- 
ment strategy has some merit. If markets overreact for the reasons we have mentioned, 
an investor can do quite well by buying when everyone else is selling and there is very 
little liquidity. However, it can be quite difficult for a financial institution to follow such 
a strategy if it is subject to short-term VaR-based risk management. 


Summary 


There are two types of liquidity risk: liquidity trading risk and liquidity funding risk. 
Liquidity trading risk is concerned with the ease with which positions in the trading book 
can be unwound. The liquidity trading risk of an asset depends on the nature of the asset, 
how much of the asset is to be traded, how quickly it is to be traded, and the economic 
environment. The credit crisis of 2007 emphasizes the importance of transparency. Assets 
that are not well defined or well understood are unlikely to trade in a liquid market for 
long. The liquidity of an asset at a particular time can be measured as the dollar bid— 
offer spread or as the proportional bid—offer spread. The latter is the difference between 
the bid and offer price divided by the average of the bid and offer price. The cost of 
unwinding a position in the asset is half of the bid—offer spread. Financial institutions 
should monitor the cost of unwinding the whole trading book in both normal market 
conditions and stressed market conditions. 

A trader, when faced with the problem of unwinding a large position in an asset, has 
a trade-off between the bid—offer spread and market risk. Unwinding quickly leads to 
high bid—offer spreads, but low market risk. Unwinding slowly leads to lower bid—ofter 
spreads, but more market risk. The optimal trading strategy depends on (a) the dollar 
bid—offer spread as a function of the quantity traded in a day and (b) the probability 
distribution for daily changes in the asset price. For any particular unwind strategy, the 
trader can choose a confidence level and calculate the unwind cost that will not be 
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exceeded with the confidence level. The unwind strategy that minimizes this cost can 
then be determined. 

Liquidity funding risk management is concerned with being able to meet cash needs 
as they arise. It is important for a financial institution to forecast its cash needs in both 
normal market conditions and stressed market conditions to ensure that they can be met 
with almost total certainty. Cash needs depend on depositor withdrawals, drawdowns on 
lines of credit, guarantees that have been made, defaults by counterparties, and so on. 
Sources of cash are instruments that can be readily converted into cash, borrowings in 
the wholesale market, asset securitizations, new depositors, cash itself, and (as a last resort) 
borrowings from a central bank. In June 2008, bank regulators issued a list of 17 principles 
describing how banks should manage their liquidity and indicated that they would be 
monitoring the liquidity management procedures of banks more carefully in the future. 

The most serious liquidity risks arise from what are sometimes termed liquidity 
black holes. These occur when all traders want to be on the same side of the market 
at the same time. This may be because they have similar positions and manage risks in 
similar ways. It may also be because they become irrationally exuberant, overexposing 
themselves to particular risks. What is needed is more diversity in the trading strategies 
followed by market participants. Traders who have long-term objectives should avoid 
allowing themselves to be influenced by the short-term overreaction of markets. 
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Practice Questions and Problems 
(Answers at End of Book) 


24.1 What was the transparency problem in the subprime crisis of 2007? 
24.2 An asset is quoted bid 50, offer 55. What does this mean? What is the proportional 
bid—offer spread? 
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24.3 


24.4 


24.5 


24.6 


24.7 
24.8 


24.9 


24.10 
24.11 


24.12 


24.13 
24.14 


24.15 


Suppose that an investor has shorted shares worth $5,000 of Company A and 
bought shares worth $3,000 of Company B. The proportional bid—offer spread 
for Company A is 0.01 and the proportional bid—offer spread for Company B is 
0.02. What does it cost the investor to unwind the portfolio? 

Suppose that in Problem 24.3 the bid—offer spreads for the two companies are 
normally distributed. For Company A the bid—offer spread has a mean of 0.01 
and a standard deviation of 0.01. For Company B the bid—offer spread has a mean 
of 0.02 and a standard deviation of 0.03. What is the cost of unwinding that the 
investor is 95% confident will not be exceeded? 

A trader wishes to unwind a position of 60 million units in an asset over 10 days. 
The dollar bid—offer spread as a function of daily trading volume, q, is a + be“ 
where a = 0.2, b = 0.1, and c = 0.08 and q is measured in millions. The standard 
deviation of the price change per day is $0.1. What is the optimal strategy that 
minimizes the 95% confidence level for the costs? 

Explain the difference between the liquidity coverage ratio (LCR) and the net 
stable funding ratio (NSFR). 

Why is it risky to rely on wholesale deposits for funding? 

What was the nature of the funding risk problems of Ashanti Goldfields and Met- 
allgesellschaft? 

What is meant by (a) positive feedback trading and (b) negative feedback trading? 
Which is liable to lead to liquidity problems? 

What is meant by liquidity-adjusted VaR? 

Explain how liquidity black holes occur. How can regulation lead to liquidity 
black holes? 

Why is it beneficial to the liquidity of markets for traders to follow diverse trading 
strategies? 


Further Questions 


Discuss whether hedge funds are good or bad for the liquidity of markets. 
Suppose that a trader has bought some illiquid shares. In particular, the trader has 
100 shares of A, which is bid $50 and offer $60, and 200 shares of B, which is 
bid $25 and offer $35. What are the proportional bid—offer spreads? What is the 
impact of the high bid—offer spreads on the amount it would cost the trader to 
unwind the portfolio? If the bid—offer spreads are normally distributed with mean 
$10 and standard deviation $3, what is the 99% worst-case cost of unwinding in 
the future as a percentage of the value of the portfolio? 

A trader wishes to unwind a position of 200,000 units in an asset over eight 
days. The dollar bid—offer spread, as a function of daily trading volume gq, is a+ 
be where a= 0.2, b = 0.15, and c= 0.1 and q is measured in thousands. The 
standard deviation of the price change per day is $1.50. What is the optimal trading 
strategy for minimizing the 99% confidence level for the costs? What is the average 
time the trader waits before selling? How does this average time change as the 
confidence level changes? 


Chapter 25 


Model Risk Management 


he use of models by financial institutions is growing fast. Models are employed 

for making credit decisions, liquidity management, the evaluation of credit 

exposures, the valuation of derivatives, the calculation of risk measures such as 
value at risk (VaR) and expected shortfall (ES), the management of portfolios of financial 
assets for clients, the assessment of capital adequacy, customer relationship management, 
fraud detection, the identification of money laundering, and so on. Complex models, 
with advanced analytic techniques such as machine learning, are being used to automate 
some activities previously carried out by humans. (See Chapter 28 for a discussion of 
machine learning.) It is now recognized that large financial institutions need a model 
risk management function to ensure that models are appropriate for their intended pur- 
pose and are used in the correct way. 

Models are approximations to reality. The art of building a model is to capture the 
key aspects of reality for a particular application without allowing the model to become 
so complicated that it is almost impossible to use. Nearly always, a model relies on some 
assumptions about the phenomenon being modeled. It is important to understand those 
assumptions and know when they are no longer appropriate. 

It used to be the case that models could be developed by a quant on a spreadsheet and 
used with minimal oversight. This is no longer the case. Models must now be well doc- 
umented and reviewed periodically to determine whether they are still appropriate and 
are working as intended. When changes are made to models, approval is usually necessary. 
These aspects of model risk management are typically not popular with model builders, 
who tend to enjoy the creative aspects of model development and dislike spending time 
justifying their models and documenting them. 
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Earlier in this book, we have made the point that regulators have, since the 2007— 
2008 crisis, become less enamoured with the use of internal models to determine capital 
requirements. They have also become more concerned about the way models are used 
for other purposes within financial institutions. Model risk is a component of operational 
risk. Regulators want banks to have a system for managing the risks presented by models. 
Models must be developed, validated, and used in accordance with this system. 

This chapter starts by reviewing regulatory requirements on model risk management. 
It describes the role of the model validation group and then moves on to compare the way 
in which models are used in finance with the way they are used by physicists and other 
scientists. After that, it focuses on the use of models for valuation and risk management 
and examines some of the lessons that can be learned from the mistakes made in that area. 


25.1 Regulatory Requirements 


In April 2011, the U.S. Board of Governors of the Federal Reserve System published 
SR 11-7, which provides guidance to banks on effective model risk management.! This 
document has proved to be very influential, and U.S. banks are expected to ensure that 
their practices are consistent with the principles outlined in the document. Bank super- 
visors elsewhere have indicated that they expect to see a similar systematic approach to 
understanding and managing model risk. If a bank supervisor feels that a bank’s model 
risk management systems are inadequate, it may request the bank to hold additional 
capital. 

SR 11-7 defines model risk as “the potential for adverse consequences from decisions 
based on incorrect or misused model outputs and reports.” Model risk occurs primarily 
for two reasons: 


1. The model may have fundamental errors when viewed from the perspective of its 
objectives. These errors may concern the data it uses, the way calculations are done, 
numerical procedures, assumptions made, and so on. 

2. The model may be used incorrectly or inappropriately. It is important to understand 
the limitations of models. A model may have been developed assuming a particular set 
of market conditions or a particular type of customer behavior. When these change, 
the model may no longer be appropriate. 


Financial institutions should attempt to identify the sources of model risk and their 
magnitude. Complex models generally lead to more model risk than simple models. 
When models are linked or when several models are based on the same underlying 
set of assumptions, there is liable to be a greater degree of model risk. It is important 
to monitor model performance to determine as early as possible when a model is, for 
whatever reason, not performing as well as expected. 


'See Board of Governors of the Federal Reserve System, Office of the Comptroller of the Currency, 
“Supervisory Guidance on Model Risk Management,’ SR 11-7, April 2011. 
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25.1.1 Model Development 


When a model is developed, documentation is important. There are several reasons for 
this. If the model developer leaves the organization and there has been no good docu- 
mentation, it might be very difficult for others to assume responsibility for the model, 
and, as a result, the work of the developer might be lost. Regulators frequently ask to see 
model documentation when they evaluate the procedures used by a financial institution. 
Also, model validation and periodic review are facilitated if the model has been well 
documented. 

An important aspect of model documentation is a statement of the purpose of the 
model, the underlying theory, an evaluation of industry practice, and a review of pub- 
lished research. It is important that the documentation clearly explains the underlying 
mathematics, numerical procedures used, assumptions made, and so on. The documenta- 
tion should be sufficiently detailed that someone who is new to the area can understand 
what has been done. As indicated earlier, model documentation will often be viewed as 
a boring chore by the model developer. But in practice it often proves to be a useful 
discipline, forcing the model developer to step back and think about his or her work in 
a way not done before. Sometimes changes to the model result. 

Models should be tested as thoroughly as possible. This should include careful testing 
of all parts of the computer code and evaluating the model’s behavior for a range of input 
data. The testing should identify situations where the model performs well and situations 
where it performs poorly. This can be done by observing the performance of the model 
with progressively more extreme values for the inputs. In some situations, it is appropriate 
to compare the output from the proposed model with the output from a more elaborate 
model that would be computationally too slow to be used in practice. 

The way in which the performance of a model is tested will depend on the nature 
of the model. Models developed from historical data should be tested out of sample. For 
example, if a model for making loan decisions is developed from a particular set of data 
collected over a particular period, it would clearly not be appropriate to test it using those 
data. Similarly, if a portfolio management strategy for a client is developed by analyzing 
the behavior of asset prices during a particular period, it should be tested using data from 
another period. 

We have discussed in earlier chapters how back-testing can be used for VaR models. 
It can be used in other situations as well. Suppose a model based on a particular set of 
assumptions is used to price barrier options. It should be possible to determine how well 
it would have worked in the past. Would a user of the model have made a profit on 
average? If so, would the profit have been sufficiently great to compensate for the risks 
being taken? 

As a model is used in practice, a financial institution should collect information on 
whether it is reflecting economic and business realities and working in the way intended. 
There is a danger that the existence of a model may lead to a change in behavior. For 
example, a model for managing customer relationships may lead to customers behaving 
differently; a model for detecting money laundering might work well until would-be 
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money launderers learn (perhaps by trial and error) how the model works and adjust 
their behavior accordingly. Even model users may find ways of adjusting inputs to a 
model or the way the model is used to make themselves look good. 


25.1.2 Model Validation 


One result of the increasing attention being devoted to model risk management is the 
formation of model validation groups. These groups are responsible for verifying that all 
the models used by the organization are performing as expected. They provide a check 
on the work of the model developer and critically review what has been produced and 
how the model is performing. 

The person charged with validating a model should be independent of the model 
developer and the model user. Ensuring independence is critical to the success of the 
model risk management function. The reporting lines and incentives should be cho- 
sen to promote independence. The success of the model validation function should be 
judged by the objectivity of the model reviews, the issues raised, and the actions taken 
by management to address the issues. Corporate culture is important. If objective think- 
ing and the challenging of decisions are not encouraged, the model validation process is 
likely to be ineffective. Regulators are likely to ask to see instances where models and 
their use have been appropriately changed as a result of validation. 

Models are sometimes used informally in a small way prior to initial validation. How- 
ever, before a model becomes part of a financial institution’s way of operating it is impor- 
tant that a first validation be carried out. The extent and rigor of the validation should 
depend on the risks presented by the model. Consider the situation where a model 
has been developed to automate billions of dollars of loan decisions. Here it is clearly 
important that the validation be thorough. Even if the model is performing well and 
in accordance with its objectives, the model validation group might insist on a staged 
implementation with close monitoring to reduce model risk. 

Model validation should continue on an ongoing basis after the model has been 
approved for use. This ensures that the impact of changes in markets and business practices 
on the performance of the model is monitored. SR 11-7 recommends that models be 
reviewed by the validation group at least annually. Users of the model (particularly if 
they are different from the developers) can provide input, and the validation group can 
determine whether the performance of the model using recent data is as good as it was 
on the data available when the model was first produced. The outcomes from using the 
model (e.g., credit decisions made, return on investments from using the model, fraud 
attempts thwarted) should be documented and compared with what was expected. 

Business Snapshot 25.1 describes the London Whale story, which is an example of a 
situation where the model validation group was under pressure to approve a model. This 
was because the model provided justification for reducing VaR, and therefore regulatory 
capital, when a huge position in one product was hedged with an opposite position in a 
similar product. If the model validation group had more time, it would have carried out 
a more careful review of the model. It might have argued that the correlation between 
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BUSINESS SNAPSHOT 25.1 
The London Whale 


The London Whale was a trader, Bruno Iksil, who worked for the Chief Invest- 
ment Office (CIO) at JPMorgan Chase (JPM). His nickname arose from his huge 
positions in credit default swaps on CDX and iTraxx indices. (As explained in 
Section 19.4, these are products that provide protection against defaults for a 
portfolio of companies.) 

The CIO’s responsibility was to invest the bank’s surplus cash. At the end of 
2011, the bank’s positions in the credit indices were moderately large and it was 
net long (1.e., 1t was buying more protection against defaults than it was selling). 
This net long position hedged the CIO’s positions in fixed-income products. 
JPM wanted to reduce its risk-weighted assets to comply with the new Basel 
regulations. It also wanted to move toward a more neutral position in the credit 
indices because it felt that the economy was improving. Unwinding its existing 
positions in the indices would lead to a loss, partly because the positions had a 
negative mark-to-market value and partly because the positions were sufficiently 
large that unwinding them would tend to move the market against JPM. As a 
result, the CIO chose to hedge its net long position in credit indices by selling 
protection on different indices. 

The CIO expected the long and short positions to offset each other. In fact, 
this did not happen. Market participants realized that someone was taking big 
positions. (JPM, like other dealers, tries to keep its positions secret. However, 
market participants realized that there was a “whale” in the market and it even- 
tually became apparent that the whale was a trader for JPM.) Realizing that the 
positions would have to be unwound at some future time and that this would 
move the market, some market participants engaged in predatory trading, taking 
Opposite positions to JPM. This moved the market against JPM and meant that 
the values of JPM’s long and short positions were not as closely correlated as 
expected. 

The CIO increased the size of its positions. This was partly for technical 
hedging reasons and partly to defend against predators. Its huge position was 
theoretically well balanced (i.e., models showed that its value should not be 
affected to any great extent by credit spread movements). It had long and short 
positions in CDX investment-grade (IG) indices and in the CDX high-yield 
(HY, non-investment-grade) indices. It sold large amounts of credit protection 
on the 10-year CDX IG 9 index and bought large amounts of protection on 
the five-year CDX IG 9 index. (This was an index created from 125 names in 
September 2007. Although not the current CDX IG index, it still traded rea- 
sonably actively; 121 of the original 125 names had not defaulted.) Prices did 
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not move in the way that was expected, and a loss of about $6 billion was rec- 
ognized by April/May 2012. Deferred bonuses to CIO employees were clawed 
back, and Jamie Dimon, JPM’s chairman and CEO, took a pay cut for the year. 
Allegations that the value of the portfolio was misreported to shareholders were 
investigated by U.S. government agencies, and JPM ended up paying a fine of 
almost $1 billion. 

Why was the CIO allowed to take such huge risks? In January 2012, the CIO 
exceeded its VaR limit and the bank as whole exceeded its VaR limit as a result 
of the CIO’s trading. However, a new VaR model was being developed. This 
model, which was finished in the second half of January and quickly reviewed and 
approved by the model validation group, reduced the VaR of the CIO’s portfolio 
by 50% so that pre-January VaR limits were adhered to. JPM’s January 2013 
internal report on the London Whale losses (particularly the appendix) indicates 
that a number of mistakes were made. The quant who developed the model 
and the model validation group were under pressure to develop and approve 
the model. The quant reported to traders in the CIO, and the model validation 
group had time to carry out only limited back-testing of the model. In May 
2012, the model was re-evaluated. A number of serious errors were discovered 
and its use was discontinued. 

This story emphasizes the importance of the model validation function and 
why it needs to be independent. 


the two products was assumed to be too large. It could also have made the point that 
hedges that work well for small positions do not always work well for large positions. 
The bank’s use of the model for its large position had the potential to lead to predatory 
trading, rendering the historical comparisons of the prices of the products being traded 
less relevant. 

SR 11-7 argues that comprehensive validation should have three core elements: 


1. Evaluation of conceptual soundness, including developmental evidence 
2. Ongoing monitoring, including process verification and benchmarking 
3. Outcomes analysis, including back-testing 


A sound developmental process should produce documented evidence in support of 
all model assumptions. There is a danger that the model developer has a bias toward a 
particular type of model—or even an interest in showing off his or her quant skills by 
developing an unnecessarily complex model. The validation work should ensure that this 
has not happened. Stability is an important property of models. If a small change in an 
input produces a large change in the output, the model may be unstable and inappropriate 
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for its intended use. Model validation should include model stress testing where the model 
is subjected to extreme inputs to test its range of applicability. 

As already mentioned, ongoing monitoring is a core part of model validation. This 
includes ensuring that all changes made to a model are tested and documented. If the 
model user and the model developer are the same person (as is sometimes the case), there 
is a temptation for the model user to tweak the code to improve model performance. 
This is unsatisfactory and can lead to a situation where the model is nothing more than 
a collection of tweaks with no unifying theory. Ifa model is not performing as well as it 
could, a review should be undertaken and possibly a new model with better foundations 
developed. It is important for a model validation function to investigate situations where 
a model is not used because managers feel the output is not consistent with their expe- 
rience and judgment. This can be an indication that improvements or a new approach is 
necessary. 

Outcomes analysis, such as back-testing, is important for monitoring a model’s per- 
formance. Some work in this area should be carried out by the model developer and 
documented. This work should be checked by the model validation group and contin- 
ued on an ongoing basis. The reasons for any differences between expected outcomes 
and actual outcomes should be analyzed. For example, a particular liquidity management 
model might be found to work well in low-interest-rate environments, but less so when 
interest rates increase. This type of analysis can lead to model improvements. 


25.1.3 Vendor Models 


Not all models are developed internally. Some are purchased from model vendors. Some- 
times the purpose of a model that has been purchased is to benchmark a similar model 
developed internally. In this case, the vendor model can be a useful validation tool. In 
other situations, the vendor model may be used because it is less expensive than devel- 
oping a similar model internally. When this is the case, vendor models should be subject 
to the same validation as internal models. However, they present particular challenges for 
the model validation group, as the modeling expertise is external to the organization and 
some aspects of the model may be proprietary. 

Financial institutions should have processes in place for selecting vendor models. 
The vendor should be required to provide documentation explaining the model design, 
testing results, and model limitations. Financial institutions should insist that vendors 
conduct ongoing performance monitoring and outcomes analysis, and that they provide 
the results of this to their clients. 

The validation group should be responsible for approving the use of a vendor model. 
It may not have access to the computer code, but it can conduct extensive analysis on 
the results produced by the model for a variety of inputs. It should insist that the vendor 
provide full information about the data used to develop the model. There should be a 
contingency plan for a situation in which the vendor goes out of business or decides to 
stop supporting the model. One approach here is to insist that the computer code and 
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all other proprietary material be placed in escrow and made available to the client in 
circumstances in which the vendor is no longer able to provide support. 


25.2 Models in Physics and Finance 


Many individuals who were well trained in physics now work in finance developing 
models. Some models in finance are structurally similar to those in physics. For exam- 
ple, the differential equation that leads to the famous Black—Scholes—Merton option 
pricing model is the heat-exchange equation that has been used by physicists for many 
years. However, as Derman has pointed out, there is an important difference between the 
models of physics and the models of finance? The models of physics describe physical 
processes and are highly accurate. By contrast, the models of finance describe the behav- 
ior of market variables, customers, borrowers, and so on. The phenomena being modeled 
are a result of the actions of human beings. As a result, the models are at best approximate 
descriptions of the phenomena being modeled. 

One important difference between the models of physics and the models of finance 
concerns model parameters. The parameters of models in physics generally do not 
change. For example, the gravitational pull on the surface of the earth is always 32 feet per 
second per second. By contrast, parameters in finance models change daily. The volatility 
used to price an option might be 20% one day, 22% the next day, and 19% the following 
day. Indeed, as we will see later, the parameters used on a day are chosen to be consistent 
with market prices, not to determine them. 


25.3 Simple Models: Expensive Mistakes 


Models do not have to be highly complex to lead to losses. Business Snapshot 25.2 
provides a simple example of this. Kidder Peabody’s computer system did not account 
correctly for funding costs when a linear product was traded. As a result, the system 
indicated that one of the company’s traders was making a large profit when in fact he 
was taking a huge loss. The model used by Kidder Peabody should have been subject to 
independent validation. Even if the theoretical problem with the model was not picked 
up, the difference between model prices and market prices would have indicated to the 
model validation group that something was seriously wrong. 

The Kidder Peabody modeling mistake would have been easy to identify if there 
was a culture within the organization that allowed traders to be challenged. Indeed, Barry 
Finer, risk manager for the government bond desk, did point out the difficulty of making 
large arbitrage profits from a market as efficient as the U.S. government bond market. But 
his concerns were dismissed out of hand. 


?See E. Derman, My Life as a Quant: Reflections on Physics and Finance (Hoboken, NJ: John Wiley & 
Sons, 2004), and E. Derman, “Model Risk,” Risk 9, no. 2 (May 1996): 139-145. 
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BUSINESS SNAPSHOT 25.2 
Kidder Peabody’s Embarrassing Mistake 


Investment banks have developed a way of creating a zero-coupon bond, called 
a strip, from a coupon-bearing Treasury bond by selling each of the cash flows 
underlying the coupon-bearing bond as a separate security. Joseph Jett, a trader 
working for Kidder Peabody, had a relatively simple trading strategy. He would 
buy strips and sell them in the forward market. The forward price of the strip 
was always greater than the spot price, so it appeared that he had found a money- 
making machine! In fact, the difference between the forward price and the spot 
price represents nothing more than the cost of funding the purchase of the strip. 
Suppose, for example, that the three-month interest rate is 4% per annum and 
the spot price of a strip is $70. The three-month forward price of the strip is 
70¢°-°4*3/12, = $70.70. This is because, for someone who wants to own the strip 
in three months, the forward contract saves $0.70 of funding costs. 

Kidder Peabody’s computer system reported a profit on each of Jett’s trades 
equal to the excess of the forward price over the spot price ($0.70 in our 
example). By rolling his contracts forward, Jett was able to prevent the funding 
cost from accruing to him. The result was that the system reported a profit of 
$100 million on Jett’s trading (and Jett received a big bonus) when in fact there 
was a loss in the region of $350 million. This shows that even large financial 
institutions can get relatively simple things wrong! 


Sometimes the assumptions made by model developers are inaccurate but less easy 
to identify than in the Kidder Peabody case. An example here is provided by the interest 
rate swap market. A plain vanilla interest rate swap, such as the one described in Sec- 
tion 5.5.3, can be valued by assuming that forward interest rates will be realized. This is 
explained in Appendix D. For example, if the forward interest rate for the period between 
2 and 2.5 years is 4.3% per annum, the swap can be correctly valued on the assumption 
that the floating rate that is exchanged for a fixed rate at the 2.5-year point is calculated 
using this rate. It is tempting to generalize from this and argue that any swap agree- 
ment to exchange cash flows in the future can be valued on the assumption that forward 
rates are realized. This is not so. Consider, for example, what is known as a LIBOR-in- 
arrears swap. In this instrument, the floating rate that is observed on a particular date is 
paid on that date, not one accrual period later as is the case for a plain vanilla swap. A 
LIBOR-in-arrears swap should be valued on the assumption that the realized interest 
rate equals the forward interest rate plus a “convexity adjustment.’ As indicated in Busi- 
ness Snapshot 25.3, financial institutions that did not understand this lost money in the 
mid-1990s. 
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BUSINESS SNAPSHOT 25.3 
Exploiting the Weaknesses of a Competitor’s Model 


A LIBOR~in-arrears swap is an interest rate swap where the floating interest 
rate is paid on the day it is observed, not one accrual period later. Whereas a 
plain vanilla swap is correctly valued by assuming that future rates will be today’s 
forward rates, a LIBOR-in-arrears swap should be valued on the assumption that 
the future rate is today’s forward interest rate plus a “convexity adjustment.” 

In the mid-1990s, sophisticated financial institutions understood the cor- 
rect approach for valuing a LIBOR-in-arrears swap. Less sophisticated financial 
institutions used the naive “assume forward rates will be realized” approach. The 
result was that by choosing trades judiciously, sophisticated financial institutions 
were able to make substantial profits at the expense of their less sophisticated 
counterparties. 

The derivatives business is one where traders do not hesitate to exploit the 
weaknesses of their competitors’ models! 


A similar example to this is provided by the interest rate futures markets. In most 
situations, it is reasonable to assume that a futures price is the same as the corresponding 
forward price. For example, the futures price in a futures contract to deliver gold in one 
year should be very close to the forward price in a forward contract to deliver gold at the 
same time. There are some differences between the two contracts. For example, a futures 
contract is settled daily and the party with the short position has some flexibility on the 
precise delivery date. But these differences do not create a material difference between the 
futures and forward prices. In the mid-1990s, many market participants chose to assume 
that the futures rate in an interest rate futures contract is the same as the corresponding 
forward rate. In fact, the approximate result that is true for most other futures contracts 
does not apply to interest rate futures. This is because interest rate futures last a long time 
so that the impact of daily settlement cannot be ignored. Also, in the case of interest rates, 
futures and forwards are settled at different times. Futures are settled at the beginning of 
the period covered by the underlying rate whereas forwards are settled at the end? 


25.3.1 Monitoring Trading Patterns 


The situations we have just outlined suggest that an important way of identifying 
model risk in the valuation of products is to monitor trading patterns. In particular, 


3See J. Hull, Options, Futures, and Other Derivatives, 10th edition (Upper Saddle River, NJ: Pearson, 
2017), for a more detailed discussion. 
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the risk management function within a financial institution should keep track of the 
following: 


1. The type of trading the financial institution is doing with other financial institutions 
2. How competitive it is in bidding for different types of structured transactions 
3. The profits being recorded from the trading of different products 


Getting too much of a certain type of business, or making huge profits from relatively 
simple trading strategies, can be a warning sign. Ifa financial institution finds that its prices 
are out of line with the market, it must make adjustments to its valuation procedures to 
bring them into line. 

The high profits being recorded for Joseph Jett’s trading at Kidder Peabody (see 
Business Snapshot 25.2) should have indicated that something was amiss. Likewise, if in 
the mid-1990s a financial institution’s risk management team discovered that traders were 
entering into a large number of LIBOR-in-arrears swaps with other financial institutions 
(see Business Snapshot 25.3) where they were receiving fixed and paying floating, they 
could have alerted modelers to a potential problem and directed that trading in the 
product be temporarily stopped. 


25.4 Models for Pricing Actively Traded Products 


When a financial instrument trades actively, we do not need a model to know what its 
price is. The market tells us this. Suppose, for example, that a certain option on a stock 
index trades actively and is quoted by market makers as bid $30, offer $30.5. Our best 
estimate of its current value is the mid-market price of $30.25. 

A situation that is common in the over-the-counter market is one where a financial 
instrument that has to be valued is a standard product such as an option, but not exactly 
the same as one that trades in the market. For example, it might be an option with a 
strike price or time to maturity (or both) different from the options whose prices can 
be observed. The model is then used as a tool to ensure that the way an instrument is 
priced is consistent with the observed market prices of other similar instruments. A good 
example of how this is done is provided by the way the Black—Scholes—Merton model is 
used in practice. (See Appendix E for a description of the Black—Scholes—Merton model.) 

The Black—Scholes-Merton model has been successful because all its inputs except 
one are known with a high degree of certainty. The one input that cannot be observed 
is volatility. This means that there is a one-to-one correspondence between prices and 
volatilities. When the volatility for an option has been specified, the price can be calcu- 
lated. Similarly, when the price is known, a volatility can be determined. The volatility 
determined from a market price is known as an implied volatility. Traders responsible for 
options on a particular asset keep track of what is known as the volatility surface. This is 
the set of implied volatilities calculated for options with different strike prices and times 
to maturity. 
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Table 25.1 Volatility Surface 
Volatilities for different strike prices and maturities are shown as % per annum. 


Strike Price 


0.90 0.95 1.00 1.05 1.10 
1-month 14.2 13.0 12.0 13.1 14.5 
3-month 14.0 13.0 12.0 13.1 14.2 
6-month 14.1 13.3 12.5 13.4 14.3 
1-year 14.7 14.0 13.5 14.0 14.8 
2-year 15.0 14.4 14.0 14.5 15.1 
5-year 14.8 14.6 14.4 14.7 15.0 


Table 25.1 provides an example of the volatility surface that could be constructed for 
options on an exchange rate. (For simplicity we assume that the current exchange rate is 
close to 1.00.) Note that, if the Black—Scholes—Merton model were a perfect description 
of reality, the volatilities in the table would be constant and not vary through time. In 
practice, implied volatilities for options are not the same for different strike prices and 
times to maturity, and they do vary through time. (This is true for options on all assets.) 
It is the trader’s job to understand the current volatility surface and how it could change 
through time. 

Suppose that a trader wishes to value a nine-month option with a strike price of 1.05 
on the day Table 25.1 is produced. A six-month option with this strike price has a volatil- 
ity of 13.4% and a one-year option with this strike price has a volatility of 14.0%. Inter- 
polating between these, a trader would use a volatility of 13.7% in the Black—Scholes— 
Merton model to value the option. Suppose next that both the time to maturity and the 
strike price are different from those in the table. A two-dimensional interpolation is then 
necessary. This involves first interpolating between strike prices and then between times 
to maturity. (The same answer is obtained if one first interpolates between maturities 
and then between strike prices. See Problem 25.12.) Consider a 1.5-year option with a 
strike price of 0.92. A one-year option with this strike price has an interpolated implied 
volatility of 14.42%; a two-year option with the strike price has an interpolated volatility 
of 14.76%. The interpolated volatility for a 1.5-year option is therefore 14.59%. 


25.4.1 Sources of Model Risk for Actively Traded Products 


One issue for the model validation group is whether the prices determined for actively 
traded instruments depend on the model used. If, in the example just considered, we 
replaced the Black—Scholes—Merton model by another model, would the prices be sim- 
ilar? The answer to this question is that, when the nature of the product being valued is 
exactly the same as actively traded instruments (i.e., only parameters like the strike price 
and time to maturity are different), the model does not usually affect prices materially 


+ This is demonstrated by S. Figlewski, “Assessing the Incremental Value of Option Pricing Theory 
Relative to an Informationally Passive Benchmark,” Journal of Derivatives (Fall 2002): 80-96. 
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This is because the model is being used as nothing more than a sophisticated interpola- 
tion tool and two different interpolation tools usually give very similar valuations. 

There is liable to be model risk in extreme situations. The model validation group 
should check this out. For example, it should examine the operation of different models 
when the volatility surface is very steep in either the strike price or time-to-maturity 
dimension. Also, the model validation group should check on the reliability of the pricing 
data that are available in the market and how the model would work when less data than 
usual are available. 


25.4.2 Hedging 


In the case of actively traded instruments, the main model risk concerns hedging. As we 
have discussed, switching from one model to another does not usually affect prices to 
any great extent, but it can have an effect on hedging performance. 

We can distinguish between within-model hedging and outside-model hedging. Within- 
model hedging is designed to deal with the risk of changes in variables that are assumed 
to be uncertain by the model. Outside-model hedging deals with the risk of changes in 
variables that are assumed to be constant (or deterministic) by the model. For the Black- 
Scholes—Merton model, hedging against movements in the underlying stock price (delta 
and gamma hedging) is within-model hedging, because the model assumes that stock 
price changes are uncertain. However, hedging against volatility changes (vega hedging) 
is outside-model hedging because the model assumes that volatility is constant. 

In practice, traders almost invariably do outside-model hedging as well as within- 
model hedging. This is because, as we have explained, the calibration process results in 
parameters such as volatilities (which are assumed by a model to be constant) changing 
daily. A good options trader will monitor the exposure of a trading book to the sorts of 
shifts in the volatility surface that are typically seen? 

If a model is designed to be always consistent with market prices and hedging is 
carried out continuously for all the variables that can change (both those that are assumed 
to be constant by the model and those that are assumed to be stochastic), the value of 
the hedger’s position should in theory not change. However, in practice hedging cannot 
be carried out in this idealized way. Consider again the Black—Scholes—Merton model 
to value a position in options on an exchange rate. Each option price is a deterministic 
function of the exchange rate, its implied volatility, and time This means that the trader 
has an exposure to the exchange rate and the implied volatilities of all the options in the 
portfolio. The exposure to the exchange rate can be hedged without difficulty by trading 
the foreign currency. This is known as delta hedging and is usually done at least once a 
day (see Chapter 8). Hedging the risk of volatility changes is not as easy. It requires trades 


5A principal components analysis, as described in Chapter 9, can be used. If exposure to each of the 
major principal components is small, the exposure to the shifts that have been observed in practice 
should also be small. 

© To keep this example simple, it is assumed that interest rates and dividend yields do not change. 
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Observe market prices for 
similar instruments that trade. 


Imply model parameters. 
Interpolate as appropriate. 


Value new instrument. 


Figure 25.1 How Models Are Usually Used for Valuing a Financial Instrument 


in other options (which can be expensive) and assumptions about how the volatility 
surface can change through time.’ As a result, volatility hedging cannot be anything near 
as effective as delta hedging. 

In practice, delta hedging is often the only hedging that a trader carries out on a 
regular basis. The delta of an option position does depend on the model used. This 
means that the hedge position taken by a trader varies from one model to the next. 
An important activity for the model validation group is to test the effectiveness of delta 
hedging for alternative models. 


25.4.3 P&L Decomposition 


One analysis that can be carried out is known as P&L decomposition. Risk managers divide 
day-to-day changes in the value of a portfolio into: 


1. A change resulting from risks that were unhedged 
2. A change resulting from the hedging model being imperfect 
3. A change resulting from new trades done during the day 


If the day-to-day change is unacceptable, the analysis will indicate the areas where more 
effort should be expended. The good news here is that, on average, the total gain or 
loss to a financial institution from inaccurate hedging is often small because it is largely 
diversified away across the portfolio of a large financial institution. 


25.5 Models for Less Actively Traded Products 


The general procedure for pricing derivatives is to build a model and then imply the 
parameters of the model from the prices that can be observed in the market. This is 
illustrated in Figure 25.1. One or more other financial instruments whose prices can be 


7If perfect hedging against volatility surface movements were possible, differences in delta hedging 
would be offset by differences in volatility hedging. But volatility hedging is in practice far from 
perfect. 
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observed in the market are chosen as the “calibrating instruments.” These instruments 
are chosen so that they are as similar as possible to the financial instrument of interest. 
They are then used to imply parameters for the model being used for the instrument 
under consideration. 

We have argued that there is very little model risk as far as valuation is concerned 
when this approach is used for standard instruments that trade actively. The same is not 
true for non-standard instruments. Here the model used is likely to have a significant 
effect on valuations as well as on hedging. An example of model risk in the pricing of 
structured products is provided by Business Snapshot 6.1. This makes the point that the 
assumption that the BBB-rated tranche of an asset-backed security (ABS) is similar to a 
BBB bond is not a good one when tranches of an ABS CDO are being priced. 

An important point to note is that we are nearly always concerned with what might 
be termed relative valuation. The objective is to value the financial instrument consistently 
with other similar financial instruments whose prices can be observed. It is very rare to 
use a model to value a derivative in absolute terms. This is another difference between 
finance models and the models of physics. 

A financial institution should not rely on a single model for pricing non-standard 
products. Instead it should, whenever possible, use several different models. This leads to a 
price range for the instrument and a better understanding of the model risks being taken. 

Suppose that three different models give prices of $6 million, $7.5 million, and 
$8.5 million for a particular product that a financial institution is planning to sell to a 
client. Even if the financial institution believes that the first model is the best one and plans 
to use that model as its standard model for daily repricing and hedging, it should ensure 
that the price it charges the client is at least $8.5 million. Moreover, it should be con- 
servative about recognizing profits. If the product is sold for $9 million, it is tempting to 
recognize an immediate profit of $3 million ($9 million less the believed-to-be-accurate 
price of $6 million). However, this is overly aggressive. A better, more conservative, prac- 
tice is to put the $3 million into a reserve account and transfer it to profits slowly during 
the life of the product 


25.5.1 Weighted Monte Carlo Simulation 


When different models are being used, they should be calibrated to the prices of actively 
traded standard products that are observed in the market. For example, when a barrier 
option is being priced, the models tested should produce prices for regular call and put 
options that are equal to those observed in the market. Rather than testing a small number 
of different models, a sophisticated analysis can be attempted to answer the question: 
“What range of model prices is possible for models that price actively traded products 
correctly?” This analysis, if carried out successfully, will give a worst possible price and a 
best possible price for the non-standard instrument. One approach is known as weighted 


8 This is also likely to have sensible implications for the way bonuses are paid. 
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Monte Carlo simulation? This involves carrying out a Monte Carlo simulation for one 
model and then applying weights to the sampled paths. Constraints are placed on the 
weights so that standard instruments are priced correctly by the weighted Monte Carlo 
simulation. An optimization procedure is used to find the weights that produce maximum 
and minimum values for the non-standard derivative consistent with the constraints. 


25.6 Accounting 


Products that are held in the trading book must be valued daily. This process is referred 
to by traders as marking to market. The valuations are used by accountants to calculate 
quarterly financial statements. Accountants refer to marking to market as “fair-value 
accounting.” 

Accountants recognize that some models produce more reliable prices than others. 
The Financial Accounting Standards Board in the United States (FAS 157) and the Inter- 
national Accounting Standards Board in Europe (IAS 39) require banks to report three 
categories of valuations separately: 


Level 1 valuations: Valuations for instruments where there are quoted prices in active 
markets. 

Level 2 valuations: Valuations for instruments where there are quoted prices for similar 
instruments in active markets or quoted prices for the same instrument in markets 
that are not active. 

Level 3 valuations: Valuations for instruments where some modeling assumptions are 
required. (This is sometimes referred to as marking to model.) 


The valuations we discussed in Section 25.4 where the model is used as a sophis- 
ticated interpolation tool would be classified as Level 2 valuations. The valuations in 
Section 25.5 where the calibrating instruments are somewhat different from the instru- 
ment being valued would be classified as Level 3 valuations. 


25.7 What Makes a Successful Pricing Model? 


It might be argued that any model where the values of input parameters change from 
day to day is not helpful. Certainly this would be true for models in physics. But, as has 
been explained, models for valuing financial instruments are used in a quite different way 
from the models of physics. 

The Black—Scholes—Merton model is popular for three reasons: 


1. It can be used in the way we have described for interpolation in conjunction with a 
volatility surface, and ensures that prices are consistent with observed market prices. 


? See, for example, A. Elices and E. Gimenez, “Weighted Monte Carlo,” Risk (May 2006). 
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2. It is a communication tool. Traders often prefer quoting implied volatilities to quot- 
ing option prices, because implied volatilities are more stable than option prices and, 
as a result, their quotes do not change as frequently. When the price of the underlying 
asset changes or the interest rate changes, the option price changes but the implied 
volatility may not. 

3. The model is sufficiently simple that a trader can develop intuition about the model 
and use it to structure his or her thinking about option markets. 


Models in other areas that have these properties have also proved to be popular. 
Model developers and the model validation group should understand that the ultimate 
purpose of a model is often to enable the model user to understand the market and 
exercise his or her judgment more efficiently. When alternative models are compared, 
this is an important consideration and may lead to simpler models being preferred to 
more complex ones. 


25.8 Model Building Missteps 


The art of model building is to capture what is important for valuing and hedging an 
instrument without making the model more complex than it needs to be. Sometimes 
models have to be quite complex to capture the important features of a situation, but 
this is not always the case. 

One potential misstep in model building is over-fitting. Consider the problem posed 
by the volatility surface in Table 25.1. We can exactly match the volatility surface with 
a single model by extending the Black—Scholes—Merton model so that volatility is a 
complex function of the underlying asset price and time! This can be desirable for 
some applications, but the model can become difficult to use. We may find that some 
properties of the model are less reasonable than those of simpler models. In particular, 
the joint probability distribution of the asset price at two or more times might be less 
than ideal, and, in some circumstances, future volatility surfaces given by the model might 
look quite different from those observed in the market today.'! 


10 This is the implied volatility function model proposed by B. Dupire, “Pricing with a Smile,” Risk 7 
(February 1994): 18-20; E. Derman and I. Kani, “Riding on a Smile,” Risk (February 1994): 32-39; 
M. Rubinstein, “Implied Binomial Trees,” Journal of Finance 49, no. 3 July 1994): 771-818. 

11 Models that fit the volatility surface at all future times accurately price instruments where there is 
a payoff at one future time that is dependent only on the asset price at that time. However, they are 
liable to be less accurate for instruments such as barrier options and compound options that depend 
on the joint probability distribution of the asset price at two or more times. Hull and Suo find that 
the implied volatility function model works reasonably well for compound options, but is less accurate 
for barrier options. See J. Hull and W. Suo, “A Methodology for the Assessment of Model Risk and 
Its Application to the Implied Volatility Function Model,” Journal of Financial and Quantitative Analysis 
37, no. 2 June 2002): 297-318. 
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Another potential misstep in model building is over-parameterization. The Black- 
Scholes—Merton model can be extended to include features such as stochastic volatility 
or jumps in the asset price. This invariably introduces extra parameters that have to be 
estimated and makes the model more difficult to use. It is usually claimed that the param- 
eters in complex models are more stable than those in simpler models and do not have 
to be adjusted very much from day to day. This may be true, but we should remember 
that we are not dealing with physical processes. The parameters in a complex model may 
remain relatively constant for a period of time and then change, perhaps because there 
has been what economists refer to as a regime change. A financial institution may find 
that a more complicated model is an improvement over a simple model until there is a 
regime change. The more complicated model may not then have the flexibility to cope 
with changing market conditions.'* 

As we have already mentioned, simple models are often the most successful. Often 
users are skeptical of complex models because they are so-called black boxes and it is 
very difficult to develop intuition about them. In some situations, users’ skepticism is 
well founded for the reasons we have just mentioned. 


Summary 


Model risk is a form of operational risk. It is now recognized that it requires special 
attention. Regulators expect to see that a financial institution has comprehensive pro- 
cedures for developing, documenting, validating, and using models. Models should be 
reviewed periodically to ensure that they are working as originally intended. The model 
validation group should be independent of model developers and model users. It should 
provide a check on the work of the model developer and reach conclusions about the 
circumstances under which the model should be used. 

Models in finance are different from those used in the physical sciences, because they 
are ultimately models of human behavior. They can usually only provide an approx- 
imation to reality. Furthermore, from time to time there are regime shifts leading to 
fundamental shifts in the performance of models. 

Valuation models are often nothing more than sophisticated interpolation tools to 
ensure that one instrument is priced consistently with other similar instruments. As the 
instrument being valued becomes less standard, the valuation model chosen becomes 
more important. It is then a good practice for the model validation group to test the 
use of different models to obtain a realistic range for pricing and to understand the 
accompanying model risk. The models should always be calibrated to the current prices 
of actively traded instruments. The hedging performance of a model, as well as valuation 
performance, should be evaluated carefully. 


The nature of this type of problem in the social sciences is discussed in the famous Lucas Critique, 
which was propounded by the economist Robert Lucas in 1976. See R. Lucas, “Economic Policy 
Evaluation: A Critique,’ Carnegie-Rochester Conference Series on Public Policy 1 (1976): 19-46. 
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Practice Questions and Problems 
(Answers at End of Book) 


25.1 Explain what is meant by (a) marking to market and (b) marking to model. 
25.2 What are the two categories of model risk identified in SR 11-7? 
25.3 Why is it important that a model be fully documented? 
25.4 “The Black—Scholes—Merton model is nothing more than a sophisticated inter- 
polation tool.” Discuss this viewpoint. 
25.5 Using Table 25.1, calculate the volatility a trader would use for an eight-month 
option with a strike price of 1.04. 
25.6 What is the key difference between the models of physics and the models of 
finance? 
25.7 How is a financial institution likely to find that it is using a model different from 
its competitors for valuing a particular type of derivatives product? 
25.8 Distinguish between within-model and outside-model hedging. 
25.9 What is meant by P&L decomposition? 
25.10 What is meant by “Level 1,” “Level 2,” and “Level 3” valuations in accounting? 
25.11 What is meant by over-fitting and over-parameterization in model building? 
25.12 Section 25.4 calculates the implied volatility of an option with strike price 0.92 
and time to maturity 1.5 years using the data in Table 25.1 as 14.59%. It does this 
by interpolating first along the strike price dimension and then along the time to 
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25.13 


25.14 


25.15 


25.16 


maturity dimension. Show that the same answer is obtained by interpolating first 
along the time to maturity dimension and then along the strike price dimension. 


“For structured products, traders mark to model. They do not mark to market’ 
Explain this remark. 


Further Questions 


Suppose that all options traders decide to switch from Black—Scholes—Merton to 
another model that makes different assumptions about the behavior of asset prices. 
What effect do you think this would have on (a) the pricing of standard options 
and (b) the hedging of standard options? 

Using Table 25.1, calculate the volatility a trader would use for an 11-month 
option with a strike price of 0.98. 

Suppose that a financial institution uses an imprecise model for pricing and hedg- 
ing a particular type of structured product. Discuss how, if at all, it is likely to 
realize its mistake. 


Chapter 26 


Economic Capital and 


RAROC 


p to now, we have focused on the development of procedures for evaluating 

different components of a financial institution’s risk (credit risk, market risk, 

operational risk, and so on). We now consider how risks can be aggregated 
and allocated to different business units. 

Economic capital (sometimes referred to as risk capital) is a financial institution’s own 
internal estimate of the capital it needs for the risks it is taking. It is different from 
regulatory capital, which in the case of banks is based on one-size-fits-all rules determined 
by the Basel Committee. Economic capital can be regarded as a “currency” for risk- 
taking within a financial institution. A business unit can take a certain risk only when it 
is allocated the appropriate economic capital for that risk. The profitability of a business 
unit is measured relative to the economic capital allocated to the unit. 

In this chapter, we discuss the approaches a financial institution uses to arrive at 
estimates of economic capital for particular risk types and particular business units, and 
how these estimates are aggregated to produce a single economic capital estimate for the 
whole financial institution. We also discuss risk-adjusted return on capital, or RAROC. 
This is the return earned by a business unit on the capital assigned to it. RAROC can 
be used to assess the past performance of business units. It can also be used to forecast 
future performance of the units and decide on the most appropriate way of allocating 
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capital in the future. It provides a basis for determining whether some activities should 
be discontinued and others expanded. 


26.1 Definition of Economic Capital 


Economic capital is usually defined as the amount of capital a financial institution needs 
in order to absorb losses over one year with a certain confidence level. The confidence 
level is therefore the probability that the bank’s losses will not exceed the economic cap- 
ital in one year. The confidence level depends on the financial institution’s objectives. A 
common objective for a large international bank is to maintain a AA credit rating. Cor- 
porations rated AA have a one-year probability of default of about 0.02%. This suggests 
that the confidence level should be set as high as 99.98% for economic capital to be a 
guide as to what is necessary to maintain a AA rating. The probability that capital will 
be insufficient to absorb losses so that a default results is then the required 0.02%. For 
a bank wanting to maintain a BBB credit rating, the confidence level can be lower. A 
BBB-rated corporation has a probability of about 0.2% of defaulting in one year, so that 
a confidence level of 99.8% gives the capital necessary to maintain a BBB rating. 

Capital is required to cover unexpected loss. This is defined as the difference between 
the actual loss and the expected loss. As explained in Chapter 15, the idea here is that 
expected losses should be taken account of in the way a financial institution prices its 
products so that only unexpected losses require capital. As indicated in Figure 26.1, the 
economic capital is the difference between expected loss and the X percentile point on 
the probability distribution of the loss, where X% is the confidence level. 


Expected X 
loss percentile 
le > 
Capital 


Loss over 
one year 


Figure 26.1 Calculation of Economic Capital from One-Year Loss Distribution 
X% is the confidence level. 
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Example 26.1 


Suppose that the main business of an AA-rated bank is lending in a certain region of the 
world. The bank estimates its losses as 1% of outstanding loan principal per year on aver- 
age. The one-year 99.98% VaR is estimated as 5% of the outstanding loan principal. The 
economic capital required per $100 of loans made is therefore $4. (This is the difference 
between the one year 99.98% VaR and the expected loss.) 


26.1.1 Approaches to Measurement 


There are two broad approaches to measuring economic capital: the “top-down” and 
“bottom-up” approaches. In the top-down approach, the volatility of the financial insti- 
tution’s assets is estimated and then used to calculate the probability that the value of 
the assets will fall below the value of the liabilities by the end of the time horizon. A 
theoretical framework that can be used for the top-down approach is Merton’s model, 
which was discussed in Section 19.8. 

The approach most often used is the bottom-up approach where loss distributions 
are estimated for different types of risk and different business units and then aggregated. 
The first step in the aggregation can be to calculate probability distributions for losses by 
risk type or losses by business unit. A final aggregation gives a probability distribution of 
total losses for the whole financial institution. 

The various risks facing a bank or other financial institution are summarized in 
Figure 26.2. As we saw in Chapter 23, bank regulators have chosen to define operational 
risk as: “The risk of loss resulting from inadequate or failed internal processes, people, 
and systems or from external events.” Operational risk includes model risk and legal risk, 
but it does not include risk arising from strategic decisions and reputational risk. We will 
collectively refer to the latter risks as business risk. Regulatory capital is not required for 
business risk, but some banks do assess economic capital for it. 


Total risk | 
Non-business risk Business risk 
(regulatory capital) (no regulatory capital) 
Credit risk Risk from strategic 
Market risk decisions 
Operational risk Reputation risk 


Figure 26.2 Categorization of Risks Faced by a Bank and Regulatory Capital Requirements 
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26.2 Components of Economic Capital 


In earlier chapters, we covered approaches used to calculate loss distributions for different 
types of risk. This section reviews some of the key issues. 


26.2.1 Market Risk Economic Capital 


In Chapters 13 and 14, we discussed the historical simulation and model-building 
approaches for estimating the probability distribution of losses or gains from market risk. 
As explained, this distribution is usually calculated in the first instance with a one-day 
time horizon. Regulatory capital for market risk in Basel I and Basel II.5 is calculated 
from 10-day 99% VaR estimates. In FRTB, as described in Chapter 18, ES is used with 
time horizons determined by liquidity. 

When calculating economic capital, we usually want to use the same time horizon 
and confidence level for all risks. The time horizon is usually one year and, as already 
discussed, the confidence level can be as high as 99.98%. Simple assumptions are (a) that 
the probability distributions of gains and losses for each day during the year are the same 
and (b) that the distributions are independent. The one-year loss or gain distribution is 
then approximately normal. Assuming 252 business days in the year, the standard devi- 
ation of the one-year loss or gain equals the standard deviation of the daily loss or gain 
multiplied by 4/252. If the average loss is zero, the 99.98% VaR is then 3.54 times the 
standard deviation of the one-year loss or gain. The 99.8% VaR is 2.88 times the standard 
deviation of the one-year loss or gain. Note that we are not assuming that the daily losses 
or gains are normal. All we are assuming is that they are independent and identically dis- 
tributed. The central limit theorem of statistics tells us that the sum of many independent 
identically distributed variables is approximately normal. 


Example 26.2 


Suppose that the one-day standard deviation of market risk losses or gains for a bank is 
$5 million. The one-year 99.8% VaR is 2.88 X 1252 x 5 = 228.6 or $228.6 million. 


In practice, the losses (gains) on different days are not perfectly independent and 
identically distributed. The independence assumption can be relaxed by assuming a 
constant autocorrelation between the returns on successive days. This allows the result 
in equation (12.5) to be used and, providing the autocorrelation is not too large, the 
loss (gain) in a year can still be assumed to be approximately normal. Sometimes, it 
may be appropriate to develop a more elaborate model of how the standard devia- 
tion of the daily loss (gain) evolves through time. Monte Carlo simulation can then 
be used to aggregate the daily distributions and the result may not then be a normal 
distribution. 
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26.2.2 Credit Risk Economic Capital 


Although Basel II gives banks that use the internal-ratings-based approach for regulatory 
capital a great deal of freedom, it does not allow them to choose their own credit corre- 
lation model and correlation parameters. When calculating economic capital, banks are 
free to make the assumptions they consider most appropriate for their situation. Credit- 
Metrics, which is explained in Section 21.4, is a popular approach for calculating credit 
risk economic capital. 

Another approach that has been proposed is Credit Risk Plus, which is described in 
Section 21.3. This approach borrows a number of ideas from actuarial science to calculate 
a probability distribution for losses from defaults. Whereas CreditMetrics calculates the 
loss from downgrades and defaults, Credit Rusk Plus calculates losses from defaults only. 

In calculating credit risk economic capital, a financial institution can choose to 
adopt a conditional or unconditional model. In a conditional (cycle-specific) model, 
the expected and unexpected losses take account of current economic conditions. In an 
unconditional (cycle-neutral) model, they are calculated by assuming economic condi- 
tions that are in some sense an average of those experienced through the cycle. Rating 
agencies aim to produce ratings that are unconditional. In addition, when regulatory cap- 
ital is calculated using the internal-ratings-based approach, the PD and LGD estimates 
should be unconditional. Obviously, it is important to be consistent when economic 
capital is calculated. If expected losses are conditional, unexpected losses should also be 
conditional. If expected losses are unconditional, the same should be true of unexpected 
losses. 

Whatever the approach used, a Monte Carlo simulation is usually necessary to cal- 
culate the probability distribution of credit losses. As explained in Chapters 20 and 21, 
derivatives complicate the calculations because of the uncertainty about the exposure at 
the time of a default or downgrade. 


26.2.3 Operational Risk Economic Capital 


As explained in Chapter 23, bank regulators are switching from the Advanced Measure- 
ment Approach (AMA) for determining operational risk regulatory capital to a simpler 
approach based on a bank’s loss history. It seems likely that some sophisticated banks will 
continue to use the AMA for determining economic capital as it is based on the model 
in Figure 26.1. 


26.2.4 Business Risk Economic Capital 


As mentioned earlier, business risk includes strategic risk (relating to a bank’s decision 
to enter new markets and develop new products) and reputational risk. Business risk is 
even more difficult to quantify than operational risk and estimates are likely to be largely 
subjective. However, attempting to quantify business risk can be a useful exercise, as it 
can lead to risk managers within a financial institution acquiring a good understanding 
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Gain Loss 


Figure 26.3 Loss Probability Density Function for Market Risk 


Gain Loss 


Figure 26.4 Loss Probability Density Function for Credit Risk 


of the portfolio of business risks being taken. This enables them to assess the marginal 
impact on total risk of new strategic initiatives that are being contemplated. 


26.3 Shapes of the Loss Distributions 


The loss probability distributions for market, credit, and operational risk are very dif- 
ferent. Rosenberg and Schuermann (2004) used data from a variety of different sources 
to estimate typical shapes for these distributions! These are shown in Figures 26.3 to 
26.5. The market risk loss distribution (see Figure 26.3) is symmetrical but not perfectly 
normally distributed. A t-distribution with 11 degrees of freedom provides a good fit. 
The credit risk loss distribution in Figure 26.4 is quite skewed, as one would expect. The 
operational risk distribution in Figure 26.5 has a quite extreme shape. Most of the time, 
losses are modest, but occasionally they are very large. 


1 See J. V. Rosenberg and T. Schuermann, “A General Approach to Integrated Risk Management with 
Skewed, Fat-Tailed Risks,” Federal Reserve Bank of New York, Staff Report no. 185, May 2004. 
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Gain Loss 


Figure 26.5 Loss Probability Density Function for Operational Risk 


We can characterize a distribution by its second, third, and fourth moments. Loosely 
speaking, the second moment measures standard deviation (or variance), the third 
moment measures skewness, and the fourth moment measures kurtosis (i.e., the heavi- 
ness of tails). Table 26.1 summarizes the properties of typical loss distributions for market, 
credit, and operational risk. 


26.4 Relative Importance of Risks 


The relative importance of different types of risk depends on the business mix. Credit risk 
is important in commercial lending, retail lending, and a financial institution’s derivatives 
business. Market risk is important in trading and some investment banking activities. 
But as mentioned in Chapter 23, operational risk (particularly cyber risk, legal risk, and 
compliance risk) is now considered by many observers to be the most important risk for 
financial institutions. 

Operational risk has always been considered to be the most important risk in asset 
management. If the asset manager is found to be negligent in some way, there are liable to 
be expensive investor lawsuits. Business Snapshot 26.1 gives one example of this. Another 
high-profile example is provided by Unilever’s pension plan. Mercury Asset Manage- 
ment, owned by Merrill Lynch, pledged not to underperform a benchmark index by 
more than 3%. Between January 1997 and March 1998, it underperformed the index by 
10.5%. Unilever sued Merrill Lynch for $185 million and the matter was settled out of 


Table 26.1 Characteristics of Loss Distributions for Different Risk Types 


Second Moment Third Moment Fourth Moment 
(standard deviation) (skewness) (kurtosis) 
Market risk High Zero Low 
Credit risk Moderate Moderate Moderate 


Operational risk Low High High 
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BUSINESS SNAPSHOT 26.1 
The EGT Fund 


In 1996, Peter Young was fund manager at Deutsche Morgan Grenfell, a sub- 
sidiary of Deutsche Bank. He was responsible for managing a fund called the 
European Growth Trust (EGT). It had grown to be a very large fund and Young 
had responsibilities for managing over one billion pounds of investors’ money. 

Certain rules applied to EGT. One of these was that no more than 10% of 
the fund could be invested in unlisted securities. Peter Young violated this rule 
in a way that it can be argued benefited him personally. When the facts were 
uncovered, he was fired and Deutsche Bank had to compensate investors. The 
total cost to Deutsche Bank was over 200 million pounds. 


court. The Spanish bank Santander incurred operational risk losses in 2009 on funds it 
managed for investors and placed with Bernard Madoff, who, it transpired, was running a 
$50 billion Ponzi scheme. 


26.4.1 Interactions between Risks 


There are interactions between the different types of risks. If a financial institution’s 
counterparty defaults, credit risk exists only if market variables have moved so that the 
uncollateralized value of the derivative to the financial institution is positive. Another 
interaction is that the probability of default by a counterparty may depend on the value 
ofa financial institution’s contract (or contracts) with the counterparty. This is the wrong- 
way issue discussed in Chapter 20. If the counterparty has entered into the contract for 
hedging purposes, the dependence should be small. However, if the contract has been 
entered into for speculative purposes and the contract is large in relation to the size of 
the counterparty, the dependence is likely to be important. 

As the Long-Term Capital Management saga clearly shows, there can be interactions 
between liquidity risks and market risks (see Business Snapshot 22.1). There are also 
interactions between operational risks and market risks. It is unlikely that we would 
know about the activities of Jérôme Kerviel at Société Générale if he had guessed right 
about market movements (see Business Snapshot 5.5). Similarly, we are unlikely to hear 
about a violation of the rules for a fund (such as the one in Business Snapshot 26.1) if 
the violation leads to a gain rather than a loss. 


26.5 Aggregating Economic Capital 


A financial institution typically calculates market, credit, operational, and (possibly) busi- 
ness risk loss distributions for a number of different business units. It is then faced with 
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the problem of aggregating the loss distributions to calculate a total economic capital for 
the whole enterprise. 

The simplest approach is to assume that the total economic capital for a set of n differ- 
ent risks is the sum of the economic capital amounts for each risk considered separately, 
so that 


Eon = » E; (26.1) 
i=1 


where E 


tota] 18 the total economic capital for the financial institution facing n different 


risks and E; is the economic capital for the ith risk considered on its own. This is what 
the Basel Committee does for regulatory capital. The total regulatory capital a bank is 
required to keep is the sum of the regulatory capital amounts for credit, market, and 
operational risks. 

Equation (26.1) is clearly a very conservative assumption. It assumes perfect corre- 
lation. In the context of economic capital calculations where the confidence level is, 
say, 99.9%, it would mean that, if the market risk loss for a financial institution is in the 
extreme 0.1% tail of the distribution, the same is true for the credit risk loss and the oper- 
ational risk loss. Rosenberg and Schuermann estimate the correlation between market 
risk and credit risk to be approximately 50% and the correlation between each of these 
risks and operational risk to be approximately 20%. They estimate that equation (26.1) 
when used as a way of aggregating market, credit, and operational risks overstates the 
total capital required by about 40%. 


26.5.1 Assuming Normal Distributions 


A simple assumption when aggregating loss distributions is that they are normally dis- 
tributed. The standard deviation of the total loss from n sources of risk is then 


(26.2) 


where ©, is the standard deviation of the loss from the ith source of risk and p; is the 
RN between risk i and risk j. The capital requirement can be calculated from 
this. For example, the excess of the 99.9% VaR over the expected loss is 3.09 times the 
number calculated in equation (26.2). 

This approach tends to underestimate the capital requirement because it takes no 
account of the skewness and kurtosis of the loss distributions. Rosenberg and Schuer- 
mann estimate that, when the approach is applied to aggregating market, credit, and 
operational risks, the total capital is underestimated by about 40%. 
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26.5.2 Using Copulas 


A more sophisticated approach to aggregating loss distributions is by using copulas. Cop- 
ulas were discussed in Chapter 11. Each loss distribution is mapped on a percentile to 
percentile basis to a standard well-behaved distribution. A correlation structure between 
the standard distributions is defined and this indirectly defines a correlation structure 
between the original distributions. 

Many different copulas can be defined. The simplest is the Gaussian copula, where 
the standard distributions are assumed to be multivariate normal. Other alternatives, with 
more tail correlation, are explained in Section 11.6. 


26.5.3 The Hybrid Approach 


The approach in Section 12.9 for aggregating VaR is a popular approach for aggregating 
economic capital estimates. It is sometimes referred to as the hybrid approach. It involves 
calculating the economic capital for a portfolio of risks from the economic capital for 
the individual risks using 


(26.3) 


When the distributions are normal, this approach is exactly correct. When distri- 
butions are non-normal, the hybrid approach gives an approximate answer—but one 
that reflects any heaviness in the tails of the individual loss distributions. Rosenberg and 
Schuermann find that the answers given by the hybrid approach are reasonably close to 
those given by copula models. 


Example 26.3 


Suppose that the estimates for economic capital for market, credit, and operational risk 
for two business units are as shown in Table 26.2. The correlations between the losses 
are shown in Table 26.3. The correlation between credit risk and market risk within the 
same business unit is 0.5, and the correlation between operational risk and either credit 
or market risk within the same business unit is 0.2. (These correspond to the estimates of 
Rosenberg and Schuermann mentioned earlier.) The correlation between two different 


Table 26.2 Economic Capital Estimates for Example 26.3 


Business Units 


1 2 
Market Risk 30 40 
Credit Risk 70 80 


Operational Risk 30 90 
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Table 26.3 Correlations between Losses in Example 26.3 


MR-1 CR-1 OR-1 MR-2 CR-2 OR-2 
MR-1 1.0 0.5 0.2 0.4 0.0 0.0 
CR-1 0.5 1.0 0.2 0.0 0.6 0.0 
OR-1 0.2 0.2 1.0 0.0 0.0 0.0 
MR-2 0.4 0.0 0.0 1.0 0.5 0.2 
CR-2 0.0 0.6 0.0 0.5 1.0 0.2 
OR-2 0.0 0.0 0.0 0.2 0.2 1.0 


MR, CR, and OR refer to market risk, credit risk, and operational risk; 1 and 2 refer to business units. 

risk types in two different business units is zero. The correlation between market risks 
across business units is 0.4. The correlation between credit risks across business units is 
0.6. The correlation between operational risks across business units is zero. 


We can aggregate the economic capital using the hybrid approach. The total market 
risk economic capital is 


V 307 + 402 + 2 x 0.4 x 30 x 40 = 58.8 


The total credit risk economic capital is 


V 702 + 802 + 2 X 0.6 x 70 X 80 = 134.2 


The total operational risk economic capital is 


V302 + 902 = 94.9 


The total economic capital for business unit 1 is 


V 302 +707 + 302 + 2 x 0.5 X 30X70 + 2 x 0.2 x 30 X 30 + 2 x 0.2 x 70 x 30 = 100.0 


The total economic capital for business unit 2 is 


V 402 + 802 + 902 + 2 x 0.5 x 40 x 80 + 2 x 0.2 x 40 X 90 + 2 X 0.2 x 80 x 90 = 153.7 


The total enterprise-wide economic capital is the square root of 
30? + 40? + 70? + 80? + 307 + 90? + 2 x 0.4 x 30 x 40 + 2 x 0.5 x 30 x 70 
+2 X 0.2 x 30 X 30 + 2 x 0.5 x 40 x 80 + 2 x 0.2 x 40 x 90 
+2 X 0.6 X 70 X 80 + 2 X 0.2 x 70 x 30 + 2 x 0.2 x 80 x 90 


or 203.2. 
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There are clear diversification benefits. The sum of the economic capital estimates 
for market, credit, and operational risk is 58.8 + 134.2 + 94.9 = 287.9 and the sum of 
the economic capital estimates for two business units is 100 + 153.7 = 253.7. Both of 
these are greater than the total economic capital estimate of 203.2. 


26.6 Allocation of Economic Capital 


Suppose that the sum of the economic capital for the business units in a bank, X E,, is 
$2 billion and that the total economic capital for the whole bank, after taking less-than- 
perfect correlations into account, is $1.3 billion (= 65% of the sum of the E). The $0.7 
billion is a diversification gain to the bank. How should the total economic capital be 
allocated to the business units? 

A simple approach is to allocate 0.65£E; to business unit i. However, this is probably 
not the best approach. Consider a situation where there are 50 business units and that two 
particular units both have an economic capital of $100 million. Suppose that, when the 
first business unit is excluded from the calculations, the bank’s economic capital decreases 
by $60 million and that, when the second business unit is excluded from the calculation, 
the bank’s economic capital decreases by $10 million. Arguably, the first business unit 
should have more economic capital than the second, because its incremental impact on 
the bank’s total economic capital is greater. 

The issues here are analogous to those we discussed in Section 12.8 for allocating 
VaR. One approach is to calculate incremental economic capital for each business unit 
and then allocate economic capital to business units in proportion to their incremental 
capital. (Incremental capital is the difference between the total economic capital with 
and without the business unit.) A popular approach is to work with the component 
economic capital. This involves allocating 


OB eal 
x — 


' Ox; 
to the ith business unit, where x; is the investment in the ith business unit. As we pointed 
out in Section 12.8, a result known as Euler’s theorem then ensures that the total of the 
allocated capital is the total economic capital E,,..1- 
Define Q, as the increase in the total economic capital when we increase x; by a small 
amount Ax;,. A discrete approximation for the component economic capital for business 
unit 7 1s 


Qi (26.4) 


Yi 


where y; = Ax;/x;. 
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Example 26.4 


Consider again Example 26.3. The total economic capital is 203.2. The economic capital 
calculated for Business Unit 1 is 100 and that calculated for Business Unit 2 is 153.7. 

A simple approach would allocate 100/253.7 of the total economic capital to Busi- 
ness Unit 1 and 153.7/253.7 of the economic capital to Business Unit 2. This would 
result in 80.1 for Business Unit 1 and 123.1 to Business Unit 2. 

The incremental effect of Business Unit 1 on the total economic capital is 203.2 — 
153.7 = 49.5. Similarly, the incremental effect of Business Unit 2 on the total economic 
capital is 203.2 — 100 = 103.2. The two incremental capitals do not add up to the total 
capital (as is usually the case). However, we could use them as a basis for allocating the 
total capital. We would then allocate 49.5/(49.5 + 103.2) of the capital to Business Unit 1 
and 103.2/(49.5 + 103.2) of it to Business Unit 2. This would result in 65.9 for Business 
Unit 1 and 137.3 for Business Unit 2. 

To apply equation (26.4) and allocate component economic capital to each unit, we 
could calculate the partial derivative analytically. Alternatively, we can use a numerical 
calculation. When we increase the size of Business Unit 1 by 1%, its economic capital 
amounts for market, credit, and operational risk in Table 26.2 increase to 30.3, 70.7, and 
30.3, respectively. The total economic capital becomes 203.906 so that Q, = 203.906 — 
203.224 = 0.682. 

When we increase the size of Business Unit 2 by 1%, its economic capital amounts for 
market, credit, and operational risk in Table 26.2 increase to 40.4, 80.8, and 90.9, respec- 
tively. The total economic capital becomes 204.577 so that Q, = 204.577 — 203.224 = 
1.353. 

In this case, because we are considering 1% increases in the size of each unit, y; = 
yə = 0.01. From equation (26.4), the economic capital allocations to the two business 
units are 68.2 and 135.3. (These do not quite add up to the total economic capital of 
203.2 because we approximated the partial derivative.) 


26.7 Deutsche Bank’s Economic Capital 


Deutsche Bank publishes the results of its economic capital calculations in its annual 
financial statements. Table 26.4 summarizes the economic capital and regulatory capital 
reported at the end of 2016. Capital is calculated for credit risk, market risk, operational 
risk, and business risk. Deutsche Bank calculated a diversification benefit reflecting the 
lack of perfect correlation between the different risk types. The total economic capital 
was about 35.4 billion euros. 

The actual capital held was about 42.3 billion euros of core Tier 1 capital (i.e., com- 
mon equity), 4.6 billion euros of additional Tier 1 capital, and about 12.7 billion euros of 
Tier 2 capital. Table 26.4 shows these capital amounts as a percentage of risk-weighted 
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Table 26.4 Deutsche Bank’s Economic Capital and Regulatory Capital, December 2016 
(millions of euros) 


Credit risk 13,105 


Market risk 14,593 
Operational risk 10,488 
Business risk 5,098 
Diversification benefit (7,846) 
Total economic capital 35,438 
Total risk-weighted assets 357,518 
Common Equity Tier 1 capital (% of risk-weighted assets) 11.8% 
All Tier 1 capital (% of risk-weighted assets) 13.1% 
Total (Tier 1 plus Tier 2) capital (% of risk-weighted assets) 16.6% 


assets. Deutsche Bank exceeds the capital requirements in Basel III, including the G-SIB 
additional capital, which in the case of Deutsche Bank was 2% of risk-weighted assets. 

Table 26.5 shows how this capital was allocated to business units. The largest user of 
economic capital by far was Global Markets, which Deutsche Bank states is dominated 
by trading in support of origination, structuring, and market making. Credit risk, market 
risk, operational risk, and business risk were all greatest for this division. 


26.8 RAROC 


Risk-adjusted performance measurement (RAPM) has become an important part of how 
business units are assessed. There are many different approaches, but all have one thing 
in common. They compare return with capital employed in a way that incorporates an 
adjustment for risk. 

The most common approach is to compare expected return with economic capital. 
This is usually referred to as RAROC (risk-adjusted return on capital). The formula is 


Revenues — Costs — Expected losses 
RAROC = FT _—_—— (26.5) 


Economic capital 


Table 26.5 Allocation of Deutsche Bank’s Economic Capital to Divisions, December 
2016 (millions of euros) 


Division Allocated Economic Capital 
Global Markets 15,587 
Corporate and Investment Banking 5,015 
Private Wealth and Commercial Clients 2,473 
Deutsche Asset Management 2,480 
Postbank 3,976 
Non-Core Operations Unit 735 
Consolidations and Adjustments and Other 5,172 


Total 35,438 
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The numerator may be calculated on a pre-tax or post-tax basis. Sometimes, a risk- 
free rate of return on the economic capital is calculated and added to the numerator. 


Example 26.5 


A bank rated AA estimates its losses on loans as 1% of the outstanding principal on aver- 
age. The 99.98% VaR (i.e., the loss exceeded only 0.02% of the time) is 5% of outstanding 
loans. As indicated in Example 26.1, the economic capital required per $100 of loans is 
$4, which is the difference between the 99.98% VaR and the expected loss. (This ignores 
diversification benefits that would in practice be allocated to the lending unit as a result 
of less than perfect correlation with the bank’s other activities.) The spread between the 
cost of funds and the interest charged is 2.5%. Subtracting from this the expected loan loss 
of 1%, the expected contribution per $100 of loans is $1.50. Assuming that the lending 
unit’s administrative costs total 0.7% of the amount loaned, the expected profit is reduced 
to $0.80 per $100 in the loan portfolio. RAROC is therefore 


0.80 _ soz 
4 


An alternative calculation would add the interest on the economic capital to the 
numerator. Suppose that the risk-free interest rate is 2%. Then 0.02 X 4 = 0.08 is added 
to the numerator so that RAROC becomes 


0.88 _ 599 
4 


As pointed out by Matten (2000), it is more accurate to refer to the approach in equa- 
tion (26.5) as RORAC (return on risk-adjusted capital) rather than RAROC, because it 
is the capital (i.e., the denominator) not the return that reflects risk? In theory, RAROC 
should involve adjusting the return (i.e., the numerator) for risk. In practice, RAROC is 
used to denote a wide range of return on capital calculations. 

RAROC can be calculated ex-ante (before the start of the year) or ex-post (after the 
end of the year). Ex-ante calculations are based on estimates of expected profit. Ex-post 
calculations are based on actual profit results. Ex-ante calculations are typically used to 
decide whether a particular business unit should be expanded or contracted. Ex-post 
calculations are typically used for performance evaluation and bonus calculations. 

It is usually not appropriate to base a decision to expand or contract a particular 
business unit on an ex-post analysis (although there is a natural temptation to do this). 
It may be that results were bad for the most recent year because of a one-time event 


?See C. Matten, Managing Bank Capital: Capital Allocation and Performance Measurement, 2nd ed. (Chi- 
chester, UK: John Wiley & Sons, 2000). 
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(e.g.,an unusually high credit loss or a particularly high operational risk loss). Key strategic 
decisions should be based on expected long-term results. 


Summary 


Economic capital is the capital that a bank or other financial institution deems nec- 
essary for the risks it is bearing. When calculating economic capital, a financial 
institution is free to adopt any approach it likes. It does not have to use the one pro- 
posed by regulators. Typically, it estimates economic capital for credit risk, market risk, 
operational risk, and (possibly) business risk for its business units and then aggregates the 
estimates to produce an estimate of the economic capital for the whole enterprise. The 
risks are usually assumed to be less than perfectly correlated. The benefits of diversifica- 
tion are estimated and allocated to business units. Usually the approach used is designed 
to reflect the incremental impact of the business units on the total economic capital. 

The one-year loss distributions for market risk, credit risk, and operational risk are 
quite different. The loss distribution for market risk is symmetrical. For credit risk it is 
skewed, and for operational risk it is highly skewed with very heavy tails. 

Financial institutions calculate RAROC, the risk adjusted return on economic cap- 
ital, for each business unit by dividing the profit of the business unit by its allocated 
economic capital. The expected future RAROC is used to decide which areas of the 
business should be expanded or contracted. The actual RAROC achieved by a business 
unit is used for performance evaluation. 


Further Reading 


Dev, A. Economic Capital: A Practitioners Guide. London: Risk Books, 2004. 

Matten, C. Managing Bank Capital: Capital Allocation and Performance Measurement. 2nd ed. Chichester, 
UK: John Wiley & Sons, 2000. 

Rosenberg, J. V, and T. Schuermann. “A General Approach to Integrated Risk Management with 
Skewed, Fat-Tailed Risks.” Federal Reserve Bank of New York, Staff Report no. 185, May 2004. 


Practice Questions and Problems 
(Answers at End of Book) 


26.1 What is the difference between economic capital and regulatory capital? 

26.2 What determines the confidence level used by a AA-rated financial institution in 
its economic capital calculations? 

26.3 What types of risk are included in business risk? 

26.4 In what respects are the models used to calculate economic capital for market risk, 
credit risk, and operational risk likely to be different from those used to calculate 
regulatory capital? 
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26.5 


26.6 


26.7 


26.8 


26.9 


26.10 


26.11 


Suppose that the credit loss in a year has a lognormal distribution. The logarithm 
of the loss is normal with mean 0.5 and standard deviation 4. What is the economic 
capital requirement if a confidence level of 99.97% is used? 

Suppose that the economic capital estimates for two business units are as follows: 


Business Units 


1 2 
Market Risk 20 40 
Credit Risk 40 30 
Operational Risk 70 10 


The correlations are as in Table 26.3. Calculate the total economic capital for 
each business unit and the two business units together. 
In Problem 26.6, what is the incremental effect of each business unit on the total 
economic capital? Use this to allocate economic capital to business units. What 
is the impact on the economic capital of each business unit increasing by 0.5%? 
Show that your results are consistent with Euler’s theorem. 
A bank is considering expanding its asset management operations. The main risk 
is Operational risk. It estimates the expected operational risk loss from the new 
venture in one year to be $2 million and the 99.97% VaR (arising from a small 
chance of an large investor lawsuit) to be $40 million. The expected fees it will 
receive from investors for the funds under administration are $12 million per 
year, and administrative costs are expected to be $3 million per year. Estimate the 
before-tax RAROC. 
RAROC can be used in two different ways. What are they? 


Further Questions 


Suppose that daily gains (losses) are normally distributed with a standard deviation 

of $5 million. 

(a) Estimate the minimum regulatory capital the bank is required to hold. (Assume 
a multiplicative factor of 4.0.) 

(b) Estimate the economic capital using a one-year time horizon and a 99.9% 
confidence level assuming that there is a correlation of 0.05 between gains 
(losses) on successive days. 

Suppose that the economic capital estimates for two business units are 


Business Units 


1 2 
Market Risk 10 50 
Credit Risk 30 30 


Operational Risk 50 10 
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26.12 


26.13 


The correlation between market risk and credit risk in the same business unit 

is 0.3. The correlation between credit risk in one business unit and credit risk 
in another is 0.7. The correlation between market risk in one business unit and 
market risk in the other is 0.2. All other correlations are zero. Calculate the total 
economic capital. How much should be allocated to each business unit? 
Suppose that a bank’s sole business is to lend in two regions of the world. The 
lending in each region has the same characteristics as in Example 26.5 of Sec- 
tion 26.8. Lending to Region A is three times as great as lending to Region B. 
The correlation between loan losses in the two regions is 0.4. Estimate the total 
RAROC. 
Suppose daily losses (gains) from trading are independent and normally distributed 
with mean zero. Calculate in terms of the standard deviation of the daily losses 
(gains) (a) the basic Basel I regulatory capital requirement calculated as 3 times the 
10-day 99% VaR and (b) the economic capital calculated using a 99.97% confi- 
dence level and a one-year time horizon. Would you expect the economic and 
regulatory capital to become closer together or farther apart if daily losses/gains 
are generated by a distribution with much heavier tails than the normal distribu- 
tion? What would you expect to be the impact of the daily losses/gains exhibiting 
positive autocorrelation? 


Chapter 27 


Enterprise Risk Management 


uch of this book has focused on understanding and quantifying different 
M types of risk: market risk, credit risk, operational risk, and so on. An important 

part ofa risk manager’s job is to ensure that these risks are correctly evaluated. 
But it is also important for the risk management function to take a holistic, big-picture 
view of risk. It should identify potential adverse events and their full consequences. The 
total exposure to an adverse event can be greater than (or less than) the result obtained 
by considering each risk type separately. Enterprise risk management (ERM) is the name 
given to this holistic approach to risk management. 

In understanding ERM, it is important to distinguish top-down and bottom-up 
approaches to risk management. Bottom-up approaches are concerned with assessing 
the different types of risk borne by different business units and combining them. We dis- 
cussed how this can be done in the previous chapter. In top-down approaches, the overall 
risk appetite of the organization is defined, and this is then used to define risk limits for 
different parts of the organization. In practice, a financial institution needs to use both 
top-down and bottom-up approaches. A top-down approach is necessary to define the 
overall risk appetite, and a bottom-up approach is necessary to evaluate whether the risks 
being taken by business units are consistent with this risk appetite. 

In 2004, the Committee of Sponsoring Organizations of the Treadway Com- 
mission (COSO) issued the Enterprise Risk Management Integrated Framework (see 
www.coso.org) and proposed this definition of ERM: 


Enterprise risk management is a process, effected by an entity’s board of directors, manage- 
ment, and other personnel, applied in strategy setting and across the enterprise, designed to 
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identify potential events that may affect the entity, and manage risk to be within its risk 
appetite, to provide reasonable assurance regarding the achievement of entity objectives. 


This definition makes a number of points. First, there should be board involvement in 
ERM. Second, ERM is a component of a company’s strategy. Third, it involves identifying 
potential adverse events. Fourth, as part of ERM the company must identify its risk 
appetite and manage risks in a way that is consistent with that appetite. Fifth, ERM 
should help a company achieve its objectives and be a core part of strategic planning and 
strategic execution processes. 

Many financial institutions (and their boards) have historically had a compliance 
orientation as far as top-down risk management is concerned. Their aim was to satisfy 
regulators and stay within the letter of the law. ERM is moving financial institutions 
away from this mind-set. It is leading to the risk management function having a strategic 
orientation and being value enhancing. A company that manages risk effectively has a 
competitive advantage over another company that does not do so. 

This chapter considers the essential elements of ERM. It considers risk appetite, risk 
culture, how potential adverse events are identified, and how ERM is integrated into a 
financial institution’s strategy. 


27.1 Risk Appetite 


An essential part of ERM is concerned with specifying the organization’s risk appetite. 
How much risk is the organization prepared to take in order to achieve its strategic 
objectives and business plans? One dimension of the risk appetite of a financial institution 
is likely to be concerned with the loss it is prepared to incur if a worst-case scenario 
happens. This can be expressed in terms of an enterprise-wide value at risk (VaR) or 
expected shortfall (ES) measure. 

For a fund manager, this dimension of risk appetite is relatively easy to quantify—at 
least in theory. Chapter 1 explained the theoretical trade-off between expected return 
and risk. The mix of assets influences the beta of the portfolio, and this in turn determines 
the theoretical expected return and risk.! Figure 27.1, which is similar to Figure 1.4 in 
Chapter 1, shows the trade-off between risk and expected return for a portfolio. As shown 
in Section 1.2, if the beta of the portfolio is B, the expected return pp is 


Hp = (1-B)R- + BRy 


and the standard deviation of the return is 


Op = Boy, 


lIn practice, as explained in Chapter 1, the theoretical return for a particular value of beta is often 
used as a benchmark. Fund managers who beat the benchmark are adding value. 
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Expected 
return, pp 


ad = BR; + BRy PEPER OS ete f 


Rp 


Standard deviation 
of return, op 


Figure 27.1 Dependence of Mean and Standard Deviation of a Portfolio on Beta, B 


where 6), is the standard deviation of the market return, Rẹ is the risk-free return, and 
Ry is the expected return on the market. Assume returns are normally distributed? The 
return, R, that has a probability p of being exceeded in one year is given by 


n (=) =(1-p) 


Op 


where N is the cumulative normal distribution function. Substituting for pp and Gp we 
obtain 


R—-(1—B)Rp — PRm zi 
ENT KH 
Bo, (1 —p) 


or 


bes (27.1) 
Ry — Rp +N "(1 = p)oy 


This formula allows us to determine the beta that is consistent with a particular 
VaR objective. Table 27.1 shows returns including dividends from the S&P 500 over 
the 47 years between 1970 and 2016. The mean and standard deviation of returns over 


? This assumption is made to provide a simple illustration. The returns from the market have heavier 
tails than the normal distribution, so in practice the returns would be fitted to a different distribution, 
perhaps a Student’s f-distribution with a relatively small number of degrees of freedom. 
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Table 27.1 Total Annual Return from the S&P 500 (including dividends) between 1970 and 2016 


Year Return (%) Year Return (%) Year Return (%) 
1970 4.01 1986 18.67 2002 —22.10 
1971 14.31 1987 5.25 2003 28.68 
1972 18.98 1988 16.61 2004 10.88 
1973 —14.66 1989 31.69 2005 4.91 
1974 —26.47 1990 —3.10 2006 15.79 
1975 37.20 1991 30.47 2007 5.49 
1976 23.84 1992 7.62 2008 —37.00 
1977 -7.18 1993 10.08 2009 26.46 
1978 6.56 1994 1.32 2010 15.06 
1979 18.44 1995 37.58 2011 2.11 
1980 32.50 1996 22.96 2012 16.00 
1981 —4.92 1997 33.36 2013 32.39 
1982 21.55 1998 28.58 2014 13.69 
1983 22.56 1999 21.04 2015 1.38 
1984 6.27 2000 —9.10 2016 18.86 
1985 31.73 2001 —11.89 


Table 27.2 Return Statistics from S&P 500 Data in Table 27.1 


Mean Annual Standard Deviation of 
Period Return (%) Annual Returns (%) 
2007 to 2016 8.75 17.11 
1997 to 2016 9.38 18.50 
1970 to 2016 11.74 18.86 


the final 10 years, the final 20 years, and the whole 47 years are as shown in Table 
27.2. Suppose that the risk-free rate of interest that can be earned is 2% (Rp = 0.02) 
and that the past 10 years are used to estimate R,, and 6), so that Ry, = 0.0875 and 
Om = 0.1711. Suppose further that a fund’s risk appetite is such that it wants to have a 
95% probability of not losing more than 10% in one year. Substituting R = —0.1 and 
p = 0.95 into equation (27.1), the fund manager’s beta should be 


—0.1 — 0.02 


2 ———— © 56 
0.0875 — 0.02 + N-!(0.05) x 0.1711 


p 


This means that to achieve the VaR objective the risk manager should invest a little 
more than half a portfolio in the market and the rest a risk-free asset? 


3In practice, a fund manager is likely to want to incorporate his or her judgment about returns into 
the choice of a portfolio. A way this can be done is provided by F Black and R. Litterman, “Asset 
Allocation: Combining Investor Views with Market Equilibrium,” Journal of Fixed Income 1, no. 2 
(September 1991): 7-18. 
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Unfortunately, the available risk-return alternatives for a financial institution such 
as a bank or an insurance company cannot usually be quantified as easily as for a fund 
manager. When markets are efficient, we can use the arguments in Chapter 1 to show that 
for stock market and similar investments an efficient frontier similar to that in Figure 27.1 
should exist. But there is no reason why this efficient frontier should define trade-offs 
between risk and expected return for major strategic investments undertaken by a bank, 
insurance company, or other type of financial institution. Some strategic investments will 
be above the efficient frontier and some will be well below. 

For any company, strategic investments that are above the efficient frontier are usually 
those that take advantage of the company’s competitive advantages. Companies, whether 
financial or non-financial, should therefore define their competitive advantages and look 
for strategic investments that exploit those advantages. There is some evidence that they 
have not always been very successful at doing this. A well-known study by Bowman 
uncovered a negative relationship between risk and return for companies in most sectors. 
This is sometimes termed the Bowman paradox. In a further study, Bowman showed that 
more troubled firms often take less justifiable risks? One explanation for this might be 
that a company experiencing difficulties often has few competitive advantages. It may 
be tempted to undertake a major project that has only a small probability of success if 
success would ensure survival. 

The market in which financial institutions operate is highly competitive, and com- 
petitive advantages are not as common as they are in some other industries. Typically, 
attempts by a financial institution to increase returns involve an increase in systematic 
(beta) risk, and the financial institution’s shareholders can often achieve an equivalent (and 
sometimes better) risk-return trade-off for themselves. The main competitive advantage 
of a financial institution is the trust of its customer base (retail and commercial) that 
it has built up. Providing services to that customer base should give a satisfactory risk- 
return trade-off. For regulated financial institutions, the risk-return trade-off is impacted 
by increasingly stringent capital and liquidity requirements. 

As mentioned, the part of a financial institution’s risk appetite concerned with the 
worst-case loss, or equivalently the worst-case erosion of equity capital, can be expressed 
by specifying VaR or expected shortfall with a particular time horizon and confidence 
level. Other parts of the risk appetite are less easy to quantify. They are likely to be 
concerned with reputational risk, credit rating risk, the risk of not complying with regu- 
lations, legal risks, and so on. Some of these are likely to be expressed qualitatively rather 
than quantitatively. For example, a bank might state that any project that could apprecia- 
bly reduce its reputation among its core customers, and how trustworthy it is perceived 
to be by those customers, is unacceptable. 

Business Snapshot 27.1 describes how Wells Fargo put employees under a huge 
amount of pressure to meet unrealistic sales targets in order to increase sales. This strategy 


* Bowman, E. H., “A Risk-Return Paradox for Strategic Management,” Sloan Management Review 21 
(1980): 17-31. 
> Bowman, E. H., “Risk Seeking by Troubled Firms,” Sloan Management Review 23 (1982): 33—42. 
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BUSINESS SNAPSHOT 27.1 
Cross-Selling at Wells Fargo 


Wells Fargo is a bank that survived the 2007—2008 crisis well. It avoided the risks 
taken by other banks during the period leading up to the crisis and gained a 
reputation for stability. Unfortunately, subsequent events tarnished its reputation 
and led to a loss of trust from potential clients. 

Cross-selling involves attempting to sell multiple products to clients. A client 
with a checking account might be encouraged to take out a credit card or open 
a savings account. Wells Fargo was considered to be one of the best banks at 
cross-selling, and indeed there is nothing wrong with cross-selling. But if it is 
carried out too aggressively it can be bad for a bank’s reputation and even be 
fraudulent. 

Wells Fargo considered branch employees to be “salespeople” and clients 
to be “customers.” Branch personnel were given very aggressive quotas for the 
number of products they were expected to sell to clients, and they were under a 
huge amount of pressure. As a result, they ordered credit cards for clients without 
the clients’ consent. They also created checking and savings accounts for clients 
who had not asked for them. In many cases, these activities led to clients paying 
additional fees to the bank. Quotas were sometimes even satisfied by enrolling 
homeless people in fee-accruing products, even though they would almost cer- 
tainly fail to make payments as they became due. 

Wells Fargo’s dysfunctional sales culture was first documented by the Wall 
Street Journal in 2011 and further described in an article in the Los Angeles Times 
in 2013. Some reforms were then made, but the bank was fined $185 million 
in September 2016 for creating more than 1.5 million unauthorized deposit 
accounts and over 0.5 million unauthorized credit-card accounts between 2011 
and 2016. Later estimates in May 2017 suggested that the number of unautho- 
rized accounts was higher than this, closer to 3.5 million in total. 

Senior executives claimed that they were unaware of what was going on as a 
result of the company’s pressure-cooker sales culture. This seems unlikely given 
that employees made calls to the company’s ethics hotline and complaints from 
customers had to be dealt with. The bank fired 5,300 employees between 2011 
and 2016 as a result of the fraudulent sales practices. The CEO, John Stumpf, 
agreed to forgo $41 million in stock options and resigned in October 2016. Later, 
$28 million of his earnings were clawed back. Other executives also lost earnings. 
The bank paid $110 million to clients who had accounts opened without their 
permission. Investigations were commenced by the Senate Banking Commit- 
tee, the Securities and Exchange Commission, and prosecutors in a number of 
jurisdictions. 
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Clearly, the aggressive sales culture at Wells Fargo was misguided. It alienated 
employees, many of whom were under unreasonable stress and found it difficult 
to find new jobs when they were fired. It also destroyed the trust and good 
reputation that Wells Fargo had built up with its clients over many years. If Wells 
Fargo’s risk appetite had included a desire to avoid doing activities that could 
adversely affect its reputation, it would surely have reduced the sales pressure and 
emphasized the importance of developing good long-term relationships with 
clients. 

Wells Fargo took steps to change its sales culture under a new CEO in late 
2016. It stopped the practice of giving employees sales quotas. Some employees 
who had been fired were rehired. It will take time for the bank’s relationships 
with employees and clients to recover, but Wells Fargo is a financially strong bank 
and we can expect that it will eventually bounce back. 


led to employees engaging in questionable and sometimes fraudulent activities in order 
to meet the targets. The strategy should have been rejected (or at least monitored more 
carefully) because of its potential adverse effect on how customers perceived the bank 
and because of its adverse effect on the bank’s relationship with its employees. 

The quantification of risk has been a major theme running through this book. Up 
to now we have implicitly tended to take a bottom-up approach. We have looked at 
different ways risk measures can be calculated by business units and then considered how 
the risk measures can be combined to produce a risk measure for the whole enterprise. 
In discussing economic capital, for example, we considered how the different types of 
risk taken by different business units can be aggregated to produce an overall economic 
capital measure. The specification of a risk appetite framework is a top-down activity 
that should involve the board of directors. Risk appetite is first defined for the institution 
as a whole. It is then used to define risk limits for business lines. This can involve some 
experimentation where bottom-up calculations are used to see if the risk appetite of the 
organization is being met. For example, a financial institution wanting to preserve its AA 
rating might decide that the probability of losses exceeding $50 billion in one year must 
be 0.03% or less. It then needs to determine by trial and error what the VaR limits for its 
trading activities, its lending activities, and so on must be so that when aggregated they 
are consistent with this aspect of the risk appetite. Once risk limits have been set, it is 
important to monitor the decisions taken in the business units to ensure that the limits 
are being adhered to. 

One important element of risk appetite is risk concentration. With the benefit of 
hindsight we can say that some large banks had portfolios that were too concentrated 
in subprime mortgages in 2006 and 2007. The risk appetite statement should directly 
or indirectly have implications for concentration risk. It should limit credit exposure to 
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any one company, limit credit exposure to any one sector of the economy, limit credit 
exposure to movements in any one market variable, and so on. 

Not surprisingly, regulators are interested in encouraging banks to develop risk 
appetite frameworks. Indeed, bank supervisors are required to ensure that effective risk 
appetite statements have been developed and implemented. In 2013, the Financial Sta- 
bility Board published a document that contained definitions of relevant terminology 
and the principles on which a risk appetite framework should be based.© The document 
spells out the responsibilities of the board of directors, the chief executive officer, the 
chief risk officer, the chief financial officer, and business line managers. 


27.2 Risk Culture 


A financial institution’s risk culture is all about how decisions are made. If the risk culture 
is good, all decisions are made in a disciplined way with a careful consideration of possible 
outcomes, weighing risks and rewards. It should be emphasized that this does not mean 
that no risks are taken. It means that risks are considered in relation to potential rewards 
and the risk appetite of the organization. The Financial Stability Board has suggested the 
way bank supervisors can assess risk culture at large systemically important banks.’ 

Decisions often involve trade-offs between short-term profits and medium- to long- 
term risks. A decision that is certain to lead to an immediate profit sometimes has the 
disadvantage that it is liable to give rise to serious problems later. Consider the situation of 
a salesman who works for a financial institution and is deciding whether to sell a sophis- 
ticated financial product to a customer when he knows it does not meet the customer’s 
true needs. The sale would be profitable to the financial institution and would increase 
the year-end bonuses of the salesman and the group that developed the product. But 
it might have adverse future consequences. For example, the reputation of the financial 
institution could suffer if the customer incurs losses as a result of the product. Expensive 
lawsuits might follow. 

The product described in Business Snapshot 5.4, which was sold by Bankers Trust to 
Procter & Gamble, is a classic example of what can go wrong. The product had a fairly 
high probability of saving Procter & Gamble 75 basis points on its funding and a small 
probability of costing the company a huge amount of money. The scenario where there 
is a huge cost to Procter & Gamble may or may not have been fully explained by the 
Bankers Trust salesman. But, if there had been a good risk culture within Bankers Trust, 
management would have questioned whether it made any sense for it to sell the product 


é See Financial Stability Board, “FSB Principles for an Effective Risk Appetite Framework,” Consul- 
tative Document, July 2013. 

7See Financial Stability Board, “Guidance on Supervisory Interaction with Financial Institutions on 
Risk Culture: A Framework for Assessing Risk Culture,” April 2014. 
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at all. Indeed, the quants working for Bankers Trust would have been told not to develop 
the product in the first place. 

Procter & Gamble was one of many companies that entered into transactions sim- 
ilar to the one described in Business Snapshot 5.4 with Bankers Trust. As it happens, 
the low-probability outcome materialized and the transactions proved to be very costly 
for Bankers Trust’s clients. There were numerous lawsuits, all of which were settled 
out of court. Bankers Trust, a company that had built an enviable reputation in finan- 
cial markets, was ruined and later acquired by Deutsche Bank. Of course things could 
have worked out well and Bankers Trust’s clients could have been grateful to the bank 
for saving them 75 basis points on their funding. But the risk to Bankers Trust and 
its reputation of huge client losses was clearly too great. The risk culture was one 
where employees thought only about short-term profits and not about the longer-term 
repercussions. 

A more recent example is provided by the transaction between the Spanish bank San- 
tander and the Portuguese state-backed rail operator Metro do Porto in 2007, described 
in Business Snapshot 27.2. The transaction allowed the rail operator to reduce its funding 
costs for the first two years, but this was at the risk of very high funding costs for later 
years. From Metro do Porto’s perspective, the trade-off being made between short-term 
gain and long-term risk was highly questionable. But the same was true of Santander. 
This was one of many similar products sold by Santander to Portuguese state-owned 
entities. No doubt the products were viewed as profitable by Santander at the time it 
entered into them, but, as later settlements and lawsuits show, subsequent costs and the 
negative impact on Santander’s reputation have the potential to outweigh the short-term 
profits. 

Yet another example of the conflicts that can exist between short-term profits and 
long-term risks is provided by Business Snapshot 27.3. This describes a deal, ABACUS, 
that Goldman Sachs sold to one of its clients without, it is alleged, fully explaining the 
risks and how it had been created. This is another example of short-term profit being 
booked without regard for longer-term reputational risk and costs. 

The ABACUS deal and one or two other events have led Goldman Sachs to embark 
on a program to change its corporate culture. Everyone at the top agreed that change 
was necessary if Goldman was to win back the allegiance of its corporate and investment 
clients. The CEO, Lloyd Blankfein, traveled to 23 countries during an 18-month period 
during which he stressed that ethics and reputation were as important as making money 
(i.e., short-term profits should not be taken if there are potential adverse long-term impli- 
cations). Gary Cohn was appointed head of the Firmwide Client and Business Standards 
Committee. One thing this committee did was assess “whether our clients had the back- 
ground experience and capacity to understand the range of outcomes from transactions 
they execute with us.” The Firmwide Client and Business Standards Committee took 
steps to ensure that Goldman’s bankers and traders were interfacing honestly with clients 
and avoiding any questionable activities. Consensus is that the reputation of Goldman 
Sachs has improved as a result of the changes. 
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BUSINESS SNAPSHOT 27.2 
The Santander Rail Deal 


In 2007, the Portuguese rail company Metro do Porto (MdP) was looking for 
a way to reduce its 4.76% per annum funding costs. After consulting a number 
of banks, it agreed to a 14-year swap with the Spanish bank Santander. In this 
swap, Santander paid the 4.76% funding cost and in return MdP paid 1.76% 
plus a spread. For the first two years, the spread was zero, saving MdP 3% on its 
funding. After that, the spread was calculated each quarter as 


max[0%, Previous Spread + 2 x max(2% — R, 0) + 2 x max(R — 6%, 0) — D] 


where R is the three-month Euribor rate and D, referred to as the DigiCoupon, 
is 0.5% if R lies between 2% and 6%, and zero otherwise. 

At the start of the deal, three-month Euribor was about 4%. If it had stayed 
between 2% and 6% for the whole 14 years, MdP would have saved 3% per 
annum on its funding for the whole period. As it happened, three-month Euribor 
had dropped below 2% by 2009 when the deal was only two years old, and it 
stayed below 2% for several years. As a result, the spread increased rapidly, reaching 
40.6% in September 2013. The problem for MdP was that the spread in one 
quarter is dependent on the spread in the previous quarter. Suppose that three- 
month Euribor is 0.5% from year 2 onward. MdP’s spread would increase by 3% 
per quarter or 12% per year. If subsequently three-month Euribor increases so 
that it is in the 2% to 6% range, the spread declines, but only at the rate of 0.5% 
per quarter or 2% per year. 

The deal was subject to a lawsuit. (Disputes over other similar deals entered 
into by Santander with other Portuguese state-owned entities were settled out 
of court.) MdP should of course have run scenario analyses to understand the 
potential costs of the transaction. Whether it did this and whether it under- 
stood the cumulative nature of the spread adjustments is not known. Possibly, 
it overweighted the attractiveness of saving 3% per year for the first two years. 
According to an article in Risk magazine in 2014, one London-based corporate 
hedging adviser has described this deal as a contender for the worst trade of all 
time! 


In financial institutions, compensation plays a key role in determining the trade-offs 
between short-term profits and long-term risks. Traditionally, bonuses have been paid 
once a year to employees based on their performance during the year. This meant that 
the time horizon for decision making was usually the next bonus date (always less than 
a year). If a deal could lead to adverse consequences but not before the year end, the 
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BUSINESS SNAPSHOT 27.3 
ABACUS 


In 2007, Goldman Sachs created a product called ABACUS from a synthetic 
ABS CDO. As explained in Chapter 6,an ABS CDO is formed from a portfolio 
of tranches (often mezzanine tranches) of ABSs. The word synthetic means that 
Goldman defined the ABS CDO, but it did not actually buy the underlying 
tranches. It then acted as an intermediary between two parties. Party A bought 
a credit default swap providing protection against what the losses on the senior 
tranche of the ABS CDO would have been if it had been created. Party B sold 
the credit default swap, agreeing to provide the protection. Party A paid Party B 
an insurance premium (the CDS spread). 

So far, so good. This is what investment banks such as Goldman Sachs do all 
the time. They act as intermediaries between buyers and sellers. In this particular 
case, Party A was a hedge fund controlled by John Paulson. This was one of the 
few hedge funds to make big bets against the performance of subprime mort- 
gages and the housing market generally. It transpired that Paulson paid Goldman 
Sachs $15 million for creating the structure and that his hedge fund had a hand 
in choosing the ABSs that went into the structure. Party B was IKB Deutsche 
Industriebank AG (IKB), a bank based in Dusseldorf, Germany, that specializes in 
lending to small and medium-size companies. The portfolio manager was ACA, 
which at the time had a good reputation for managing ABS CDOs. 

It is alleged that Goldman Sachs represented to IKB that the ABS CDO 
had been defined by ACA when the ABSs in the portfolio had in fact been 
chosen by John Paulson’s hedge fund as ones that were likely to incur huge 
losses. Specifically, it is alleged that Paulson constructed the synthetic ABS CDO 
with tranches formed from portfolios of mortgages where the borrowers had 
low FICO scores and came from states where the rate of house appreciation had 
been the highest. 

Both IKB and ACA took huge losses on ABACUS, and Paulson’s hedge fund 
made a huge profit. (IKB had to be bailed out because of the massive losses it 
sustained from the subprime meltdown.) The deal was investigated by the Secu- 
rities and Exchange Commission (SEC), and Goldman Sachs ended up paying a 
fine of $550 million, the largest in the history of the SEC at that time. Goldman 
Sachs’s position was hurt when it was revealed that an executive working on 
the deal, Fabrice Tourre, had sent the following e-mail to a friend: “More and 
more leverage in the system. The whole building is about to collapse anytime 
now....Only potential survivor, the fabulous Fab... standing in the middle of 
all these complex, highly leveraged, exotic trades he created without necessarily 
understanding all the implications of those monstrosities!!!” 
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possible adverse consequences were likely to be ignored. There has been an attempt in 
many financial institutions to change compensation arrangements to address this short- 
term focus. The payment of the bonus for a year is now often spread out over several 
subsequent years, and bonuses can be clawed back (i.e., not paid) if subsequent events 
indicate that the year was not as profitable as thought5 


27.3 Identifying Major Risks 


An important part of ERM is identifying key exposures for both a financial institution’s 
existing business and any proposed strategic investments. The identification of these 
exposures is in many ways similar to the choice of scenarios for stress testing. History 
can provide some guidelines. For example, a financial institution should consider how it 
would fare if a recession as bad as that experienced during 2007 to 2009 were to occur. 
The impact of current and emerging trends should also be carefully analyzed. Economists 
and senior management working for the financial institution can provide useful insights. 
As explained in earlier chapters, the environment is changing for banks. One of the 
results of the credit crisis that started in 2007 is that banks are becoming more tightly 
regulated. Some activities such as the trading of derivatives are less profitable than before, 
and other activities such as proprietary trading can (at least in the United States) no 
longer be undertaken at all. Capital requirements and liquidity requirements are higher. 
Some financial intermediation business is moving from banks to shadow banks such as 
money market funds, mortgage companies, securitization vehicles, and so on. In many 
cases, shadow banks are less heavily regulated than banks and, as a result, can offer more 
competitive services. 


27.3.1 Cognitive Biases 


Our ability to identify risks effectively is affected by what are termed cognitive biases. 
These describe the tendencies for human beings to think in certain ways and be less than 
perfectly rational. One common cognitive bias is wishful thinking. It is sometimes difficult 
to distinguish between what we want to happen (e.g., a project to be a success) and what 
we think will happen. (Try asking a Manchester United supporter to estimate the chance 
of Manchester United winning the FA cup next year!) When we want something to 
happen, we are liable to think only of reasons why it will happen. 

Over 100 cognitive biases have been listed by psychologists. Much of the pioneering 
work was done by Daniel Kahneman and Amos Tversky. Kahneman won the Nobel 


8 Legal issues can make it difficult for financial institutions to try to recover bonuses that have already 
been paid. Hence, clawback schemes have to be combined with bonus deferral. 

? See, for example, D. Kahneman, P. Slovic, and A. Tversky, Judgment Under Uncertainty: Heuristics and 
Biases (New York: Cambridge University Press, 1982). 
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prize for economics in 2002 for his work with Tversky on prospect theory, which is 
concerned with the way people choose between risky alternatives. (Tversky had died a 
few years earlier.) 

One important bias is anchoring. When evaluating a potential outcome (e.g., the rev- 
enue resulting from a major new venture), we are liable to anchor onto the first estimate 
that is made. We tend to make relatively small adjustments to that estimate (this is referred 
to as “anchoring and adjustment”) and often never consider the full range of possible 
outcomes. In particular, important adverse outcomes may implicitly be considered to 
have no chance of occurring. To illustrate anchoring, one could ask a group of people to 
make a best guess of something that is unknown to them such as, perhaps the population 
of Iceland. They can then be asked to provide a range consisting of the 5th percentile to 
95th percentile of their subjective probability distribution. If their estimates are good, the 
true population of Iceland should lie outside the range only 10% of the time. In practice, 
it is found that this happens much more frequently. Anchoring causes people to behave 
as though they know more than they do. 

Another cognitive bias is availability. This is where recent information is given undue 
weight. Sadly, enterprise risk management—and risk management generally—can suffer 
from availability. Prior to the credit crisis, risk managers in some financial institutions 
were often not listened to because recent experience had been good. After the crisis, risk 
managers have had more influence, but as memories of the crisis fade, the “good times 
will last forever” attitude may return. 

Another cognitive bias is known as representativeness. This is where an individual cate- 
gorizes a situation based on a pattern of previous experiences or beliefs about the under- 
lying scenario. It can be useful when trying to make a quick decision, but it can also be 
limiting because it leads to close-mindedness and stereotyping. Based on previous expe- 
rience, a senior manager at a financial institution might consider it almost impossible 
for any other financial institution to successfully compete with it in a particular market. 
However, if the manager’s past experience is limited, the previous situations might not 
be representative of future scenarios. 

A more subtle mistake sometimes made in estimating probabilities is inverting the 
conditionality. Suppose that 1 in 10,000 people have a particular disease. A test that is 
99% accurate gives you a positive result (suggesting that you have the disease). What is the 
chance that you have the disease? Your immediate response is likely to be 99%. However, 
the true answer is actually about 1%! We are interested in 


Prob(D|TP) 


where D indicates that you have the disease and TP indicates a positive result from the 
test. 
It is true that 


Prob(TP|D) = 0.99 
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but 
Prob(D|TP) 4 Prob(TP|D) 


Out of 10,000 people there will be about 100 positive results on average but only one 
person with the disease.!° Hence the probability we are interested in is about 0.01. This 
is an application of the result in probability theory known as Bayes’ theorem. 

Yet another bias is the sunk costs bias. Suppose that a financial institution has already 
spent $1 billion trying to enter a new market. Things are not going well and there 
seems to be very little prospect of success. Should the $1 billion influence the financial 
institution’s decision making? The answer is the $1 billion is what accountants refer to 
as a sunk cost. Regardless of the decisions taken now, it cannot be recovered. The key 
issue is whether future profits will be sufficiently high to justify future expenditures. In 
practice, many people are reluctant to admit mistakes, and they continue with projects 
that are clearly failures for too long. Irrationally, they want to try to get back money 
already spent, even when the chance of this is very small. 

Understanding these biases may assist decision making and the identification of key 
risks. It should be noted, however, that experiments have shown that it is extremely 
difficult to eliminate biases. Even when biases such as anchoring are carefully explained 
and subjects are given financial incentives to make good estimates, the biases persist. 

The challenge for ERM is identifying tail risks and trying to estimate the probabilities 
associated with the adverse scenarios giving rise to the tail risks as well as possible. The 
cognitive biases we have discussed (and many others have been documented) suggest that 
the risks will be underestimated. Nassim Taleb makes this point in a best-selling book."! 
He is particularly critical of the use of normal distributions for calculating risk measures 
(as we have been in this book) and argues that extreme events such as the crash of 1987 
or the credit crisis of 2007 to 2009 are more likely than many people think. 

Using committees of senior management and economists to define adverse scenarios 
can be useful. Encouraging employees to play the role of devil’s advocate when ideas are 
discussed can also be important. Risks are obviously less likely to be considered if there 
is an authoritarian CEO who is convinced that he or she is right on major strategic 
decisions and does not encourage contrary views to be expressed. 


27.4 Strategic Risk Management 


Once a company’s strategy has been defined, strategic risk management involves consid- 
ering what assumptions have been made and what can go wrong. What will competitors 


'0Tf there are 100 positive results, the test is correct for 9,901 people out of 10,000 so that the accuracy 
is about 99%. 

"See N. N. Taleb, The Black Swan: The Impact of the Highly Improbable (New York: 
Random House, 2007). 
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do? How will customers react? How could the strategy be impacted by adverse external 
events? To quote Michael Porter:!? “Risk is a function of how poorly a strategy will 
perform if the ‘wrong’ scenario occurs.” 

Once a particular risk has been identified, either in the existing business or in a new 
strategic investment, there are a number of alternative courses of action: 


Avoid: Exit the business/investment. 

Reduce: Take action to reduce the probability of the adverse event or its impact. 
Adjust: Modify plans to reduce risks. 

Share or insure: Transfer or share a portion of the risk. 

Accept: Take no action. 


Which of these actions is taken is likely to depend on the situation, but it is important 
that the risk is identified and carefully considered, rather than being ignored. 

Suppose that a successful Canadian bank is considering an expansion of its retail 
operations by buying a smaller bank in the United States. It has prepared plans indicating 
that the expansion will increase shareholder value and provide a springboard for increas- 
ing its customer base in the United States. There are a number of risks that it needs to 
consider. Will it be able to transfer its success in Canada to a more competitive banking 
environment? Will it be able to retain the customers of the bank it is acquiring in the 
United States? Will it be able to retain key managers? What will happen if there is a 
severe two-year recession soon after the acquisition is completed? The Canadian bank 
needs to make sure the acquisition is consistent with its risk appetite. The risk appetite 
might require it to ensure that if a worst-case scenario happens it will survive without its 
core Canadian business being appreciably affected. It needs to ensure that the risks can 
be managed and monitored and that if things work out badly there is a good exit strategy. 


Summary 


Enterprise risk management is an attempt to take a holistic approach to risk management 
rather than addressing risk in silos. The central idea is that business units within the 
financial institution should be part of an integrated, strategic, and enterprise-wide risk 
management system. The risk appetite for the organization should be defined at the top 
and approved by the board. Steps should then be taken to ensure that the management 
of different types of risks in different business units is consistent with the overall risk 
appetite framework. 

A financial institution’s risk appetite statement should define how much risk the 
organization is prepared to take in pursuit of its objectives. Some aspects of the risk 
appetite, such as the amount of capital the financial institution is prepared to risk los- 
ing, can be expressed quantitatively. Others, such as risks to reputation, are likely to be 


12See M. E. Porter, Competitive Advantage (New York: Free Press, 1985), 476. 
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expressed qualitatively. The risk appetite must then be converted into directives for the 
risks taken by business units. Appropriate limits are then set on trading risk, how con- 
centrated the financial institution’s portfolio can become, how much credit risk can be 
taken, and so on. The risks being taken should be continually monitored to ensure that 
they are consistent with the financial institution’s risk appetite. 

Developing a good risk culture is an important part of ERM. A key element of the 
risk culture is ensuring that medium- to long-term risks are considered when opportu- 
nities that offer short-term profits are evaluated. There are many examples of financial 
institutions aggressively pursuing short-term profits when the activity being undertaken 
has led several years later to serious problems such as lawsuits, fines, or a loss of reputation. 
The bonus structure within financial institutions has in the past encouraged employees 
to use the maximization of short-term profits as their sole criterion for decision making. 
Since the 2008 credit crisis, there have been attempts to change this culture by introduc- 
ing deferred bonuses and clawbacks. 

It is important for a financial institution to develop procedures for identifying worst- 
case outcomes. Unfortunately, there are many biases that are liable to cause managers 
to misjudge future outcomes and their probabilities of occurring. Once the risks being 
taken have been identified, it is important to actively manage the risks. Sometimes it will 
be necessary to discontinue certain activities; sometimes risks can be shared (e.g., with 
insurance or joint ventures); sometimes steps can be taken to reduce the impact of the 
risks; and sometimes risks can be accommodated within the organization’s risk appetite. 
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Practice Questions and Problems 
(Answers at End of Book) 


27.1 Explain the difference between top-down and bottom-up approaches to risk man- 
agement. Why are both necessary in ERM? 

27.2 What is the Bowman paradox? 

27.3 “The risk culture at a bank should lead it to consider the longer-term implications 
of decisions as well as short-term profits.” Discuss. 

27.4 What steps can a bank take to encourage employees to consider more than short- 
term profits? 
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27.5 


27.6 


27.7 


27.8 


27.9 


27.10 


What distinguishes enterprise risk management from more traditional approaches 
to risk management? 

A fund’s risk appetite is such that it wants to be 90% certain that it will not lose 
more than 20% in any one year. Using the performance of the S&P 500 between 
1970 and 2016 (see Table 27.2), determine the beta the fund should have. Assume 
a risk-free rate of 3% per annum. 

Calculate the interest rate paid by MdP six years after the beginning of the deal in 
Business Snapshot 27.2 if the Euribor rate proves to be 8% from year 2 onward. 
Give three examples of cognitive biases. 


Further Questions 


Assume that in Business Snapshot 27.2 the change in the three-month Euribor 
rate in each quarter is normally distributed with mean zero and a standard devia- 
tion equal to x basis points. Use Monte Carlo simulation (500 trials) to calculate 
a probability distribution for the average interest rate paid by MdP over the four- 
year period for values of x equal to 10, 20, and 50. 

A fund’s risk appetite is such that it wants to be 97.5% certain it will not lose 
more than 25% in any one year. Using the performance of the S&P 500 between 
1997 and 2016 (see Table 27.2), determine the beta the fund should have. Assume 
a risk-free rate of 2.5% per annum. 


Chapter 28 


Financial Innovation 


n the 1980s, when booking a flight from New York to London, a customer would 

contact a travel agent, who would check availability, present options, receive instruc- 

tions, and then finally make a reservation. Today, that customer is likely to go to the 
airline’s website and make the booking directly with the airline. What has happened is 
a process known as disintermediation. The travel agent as an intermediary is no longer 
needed. 

This does not mean that there is no need at all for intermediaries in the travel busi- 
ness. Online services such as Expedia and Travelocity have sprung up to be of assistance 
when customers do not know which airline to use or want to quickly compare prices 
offered by different airlines. But the nature of the intermediaries in the travel business 
has changed, and the human involvement in the process of making straightforward flight 
arrangements has largely disappeared. The creation of new technology-based interme- 
diaries like Expedia and Travelocity is referred to as reintermediation. Disintermediation 
followed by reintermediation is a common pattern in technological change. 

Banks and other financial companies are intermediaries, like the travel agents of the 
1980s, and they are similarly in danger of having the services they provide disrupted by 
technology. A commercial bank is an intermediary that channels money from depositors 
to borrowers. It also is an intermediary in the payments system, clearing checks and trans- 
ferring funds as instructed by clients. Investment banks are intermediaries that facilitate 
the issue of stocks and bonds. Banks and other financial companies are intermediaries 
when they perform wealth management services for clients, channeling their savings to 
appropriate investment products. 
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FinTech is concerned with the application of technology to financial services. 
Many applications of technology such as ATMs, online banking, and online trading 
have already taken place. This chapter looks at how financial services are likely to be 
impacted by FinTech in the future, which activities are most likely to be digitized, how 
customers will respond, and how financial services companies can restructure themselves 
to adjust to rapid technological changes. These are important questions for today’s risk 
management professionals. 


28.1 Technological Advances 


The increase in computer processing power in the first 50 years of the computer age 
has been truly astounding, and this has made possible many societal advances. In 1965 
Gordon Moore, co-founder of Intel, correctly predicted that computer processing power 
would double every two years. A single smartphone now has more processing power than 
NASA had when it landed a man on the moon in 1969! 

Many activities formerly carried out by humans have been automated. An interest- 
ing development is robotics process automation (RPA). This is a software application that 
replicates the actions of human beings when they interact with a company’s systems. It 
is an alternative to automating by changing the systems themselves. The software robot 
is a virtual worker that can be trained to perform certain tasks in much the same way as 
a human would be trained. The tasks could for example be concerned with opening a 
new customer account or hiring a new employee. Typically, data are sent to a number of 
different systems within the organization. 


28.1.1 Machine Learning 


One important result of the increase in computing power has been the growth of machine 
learning. This is a branch of artificial intelligence that allows computers to learn without 
being explicitly programmed. 

Suppose we wish to teach a computer to translate between English and Japanese. One 
approach is to program all the grammatical rules of English and definitions contained in 
an English dictionary, do the same for Japanese, and then supply a Japanese—English dic- 
tionary. An alternative, pioneered by Google, that has proved to be much more successful 
is to use machine learning. The computer is fed many texts in English with Japanese 
translations produced by bilingual human beings. The computer then uses various tools 
to search for patterns and develop its own complex translation rules. 

Machine learning has many potential applications in finance, such as fraud recogni- 
tion and the automation of lending decisions. There are three types of machine learning: 


1. Supervised learning: The computer is presented with data consisting of inputs and 
desired outputs, and the goal is to develop rules for mapping inputs to outputs. The 
translation example we have just given falls into this category. 

2. Unsupervised learning: The computer is asked to learn by finding structure or hidden 
patterns in data. 
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3. Reinforcement learning: A computer program interacts with a dynamic environment in 
which it must perform a certain goal, such as driving a vehicle. 


There are many different statistical and other tools used in machine learning to reveal 
patterns in data. Supervised learning can involve linear regression and logistic regression. 
Unsupervised learning can involve clustering techniques such as the k-means algorithm. 
Sometimes dimensionality reduction algorithms such as principal components analysis 
are used before processing data. 

Human beings are very good at some pattern recognition tasks that computers have 
traditionally found difficult, for example, distinguishing one person’s face from another. 
A study of how we do this has led to some important breakthroughs in machine learning. 
Neural networks are tools designed to mimic the way humans recognize patterns. They rely 
on a mathematical procedure known as backpropagation to iteratively calculate weights 
on nodes to produce a desired output. 

A large amount of data is necessary for a machine learning algorithm to train on. 
Without parallel processing, cloud computing, and other advancements in hardware and 
software, today’s applications would not be possible. 


28.1.2 Blockchain 


Often ledgers recording the ownership of assets involve the possibility of fraud, lack of 
trust by users, and computer hacking. Many readers will have heard stories about title 
fraud where someone who is not the owner of a house manages to claim ownership 
and obtain a mortgage. Sometimes ledgers recording the same items are kept by different 
people from different organizations. There may then be inconsistencies in the records, 
and it can be expensive and time-consuming to sort these out. 

Blockchain is a distributed ledger where many people can add entries and a commu- 
nity of users control how it is updated. Problems involving inconsistent records, fraud, or 
hacking are either impossible or extremely unlikely. Identical records are on many differ- 
ent computers and can be inspected at any given time. A blockchain was first developed 
in 2009 in connection with bitcoin, but many other applications of the technology are 
now being explored. The ledger defining who owns what is a long list of blocks. When 
a new block of transactions is created (every 10 minutes in the case of bitcoin), a new 
block is added to the chain. A constantly updated list of blocks is given to everyone who 
participates. 

An important aspect of blockchain is what is known as hashing. A hash is a string of 
64 characters. Any text can be converted into a hash, but the process cannot be reversed 
so that the hash is converted back into text. Even a small change in the text will totally 
change the hash. To illustrate this, we can go to the SHA 256 hash calculator at: 


http://www.xorbin.com/tools/sha256-hash-calculator 


Hashing the title of this book, Risk Management and Financial Institutions (without italics), 
gives 


1dcc48387a27cd95378b08ab26261b161a97c51a7c9 146f3d3fF73710d656a3f 
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Adding the edition number so that Risk Management and Financial Institutions 5 is input 
(again without italics), produces a totally different hash: 


117e1e23121f8db4b75d3e2a63d37ef052b1 1a63cf448721825aadb882492c6b 


It is theoretically possible for two sets of records to give the same hash, but in practice 
this is to all intents and purposes impossible. The hash we have just looked at is formed 
from the numerals 0 to 9 and the first six letters of the alphabet. This means that there 
are 16° — 1 different hashes.! 

The records in each block of a blockchain are sealed and made tamper proof with a 
hash. One of the inputs to the hash is the hash of the immediately preceding block. This 
means that any attempt to tamper with a block will be immediately spotted because it 
will affect that block and all subsequent blocks. 

In order for the blockchain to be free from human intervention, participants must 
be given an incentive to check new blocks and hash them. Bitcoin does this by inviting 
participants, referred to as miners, to search for something called a nonce, which when 
added to the block will lead to a hash that starts with several zeros. The participant who 
finds the nonce first is rewarded with a predetermined number of bitcoins. The new 
block is then hashed and distributed to the computers of all participants. 

There are two sorts of blockchains: permissioned and permissionless. Permissionless 
blockchains such as bitcoin have no restrictions on participation and there is no need for 
participants to know or trust one another. Permissioned blockchains are privately shared 
systems, and the participants may know one another and may (at least to some extent) 
trust one another. Permissioned blockchains are a way financial institutions can share data 
to speed the settlement of transactions. 

Distributed ledger technologies (DLTs) such as blockchain are now receiving a great 
deal of attention by financial institutions and the big four accounting firms, as well as 
many other large businesses. In August 2016, the World Economic Forum reported six 
developments? 


. There had been $1.4 billion of venture capital investments in DLT since 2013. 

. More than 2,500 patents had been filed since 2013, many by financial institutions. 
. More than 24 countries were currently investing in DLT. 

. More than 90 central banks were engaged in DLT discussions. 

. More than 90 corporations had joined DLT consortia. 

. Some 80% of banks were predicted to initiate a DLT project by 2017. 


au &wN 


Whether blockchain and other DLT ideas live up to the hype remains to be seen. 
IBM in 2017 announced it was working with seven European banks to use blockchain 


1 This is approximately equal to the number of atoms in the known observable universe. (In our count, 
we exclude a hash consisting of 64 zeros.) 

?See World Economic Forum, The Future of Financial Infrastructure: An Ambitious Look at How Blockchain 
Can Reshape Financial Services, August 2016. 
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technology to make it easier for small and medium-size businesses to trade with one 
another. Other potential applications in finance are to the trading of securities, settlement, 
clearing, and other back-office functions. DLTs have the potential to speed up these 
processes. Indeed, almost real-time settlement of trades becomes a possibility. Costs should 
be reduced and there should be less need for human intervention. Operational risks 
should be reduced because the possibility of fraud or cyber-attacks should be lower. This 
may in turn lead to a reduction in regulatory capital. 


28.2 Payment Systems 


Technology has already had a huge effect on the way payments are made. As a society 
we have moved from cash and checks to credit and debit cards to the use of mobile 
wallets. Developing countries have in some respects progressed further in this direction 
than developed countries, in part because traditional payment systems were not as well 
established. Many FinTech start-ups are offering new services. Some, such as PayPal, 
Apple Pay, and Google Wallet in the United States, and Alipay (part of Alibaba) and 
Tencent in China, are large, well-established companies. 

The key attributes of a payments method are speed, convenience, security, simplicity, 
and cost. Many services such as PayPal transfer funds almost immediately. Storing credit- 
card-like information in an iPhone or similar device adds to the convenience for many 
consumers. Indeed, given the dominant position of Apple in the smartphone market, it 
has been natural for Apple to expand into payments, and some have speculated that it 
will not be long before Apple offers a wider range of banking services. One can imagine 
that wearables such as watches or bracelets, or even implants, may be used in the same 
way as smartphones to add to the convenience of the way payments are made. 

Security is a big issue for all forms of payment. Tens of billions of dollars are lost each 
year from credit card fraud. Embedding chips as well as magnetic strips in cards helps 
but does not eliminate this problem. We are likely to see big changes in the way fraud is 
avoided in the future. Digital wallets are considered to be more secure than credit cards. 
Many payment providers, including banks, are investigating the use of biometric autho- 
rization. (Biometrics often involves machine learning, which has already been discussed.) 
In the future, it may be the case that a purchase can be completed by waving the palm of 
one’s hand over a reader. Retinal scanning, facial recognition, voice authentication, and 
even heartbeat monitoring are all being considered. The costs of fraud are usually borne 
by the payments system provider, and they are of course passed on to merchants in the 
form of the fee charged. (PayPal, for example, charged 2.9% plus $0.30 per transaction 
in mid-2017.) The merchants in turn pass the fee on to consumers. Everyone therefore 
has an interest in reducing fraud, and the approaches we have just discussed for making 
more secure payments should be welcomed. 

If biometric authorization seems far-fetched, bear in mind that India has issued bio- 
metric IDs (involving fingerprints and retinal scans) to over one billion people and is 
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laying the groundwork for a cashless society. Making more people part of the finan- 
cial ecosystem is an important objective for development of the country. The biometric 
IDs also have the advantage that some government benefits can be distributed with less 
involvement from intermediaries. Of course, some would argue that the provision of 
biometric information is an unacceptable violation of a person’s privacy, and this may 
slow down its acceptance in developed countries. 

Some payment systems allow users to borrow money. The interest rates charged by 
credit card companies are very high, but it should be remembered that users do get free 
credit for the period of time between a purchase and the next monthly due date. (PayPal 
competes with this by offering 14 days of free credit.) It is likely that more convenient 
credit facilities, tailored to the needs of users, will be developed. Alibaba through its 
subsidiaries such as Alipay and Mybank is already offering many of the same services as 
banks. 

What other services can be offered to make payment systems attractive? Many indi- 
viduals remit funds on a regular basis to family members in another country. The foreign 
exchange services associated with those transactions are likely to get more convenient 
and more competitive. For businesses, easy-to-use foreign exchange hedging services 
that compete with those offered by banks are likely to be developed. FinTechs may carry 
out sophisticated analyses of sales to help a company understand its customers better or 
provide accounting services. 

Of course, people who prefer cash will continue to exist for some time. Some people 
have bad credit histories and do not qualify for credit cards, while others are too risk 
averse to give their credit card information to third parties. Amazon has recognized this 
and allows customers to open an account at selected retailers by depositing cash. When 
goods are purchased the account is debited. 

One aspect of the digitization of payments is that it becomes much easier to collect 
data on a person’s spending habits. This could be useful to banks when making credit 
decisions. (Knowing how a potential borrower spends money can be almost as impor- 
tant as knowing how much he or she earns.) Of course it presents huge privacy issues. 
Sometimes payment systems sell anonymized data. Often the data are analyzed using 
techniques such as machine learning to better understand consumer preferences. 


28.2.1 Cryptocurrencies 


The use of bitcoin as a means of payment has grown rapidly since it was first introduced 
in 2009. The number of bitcoin transactions per day was 60,000, 120,000, and 220,000 
in August 2014, August 2015, and August 2016, respectively. In the future, can we expect 
dollars, euros, and yen to be replaced by bitcoin or other cryptocurrencies? Many people 
are bullish on bitcoin, as indicated by the 2017 growth of its value when expressed in 
USS. dollars (see Figure 28.1). 

The advantage of using a cryptocurrency such as bitcoin as a means of payment is 
that it is outside the control of any one government and therefore its value cannot be 
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Figure 28.1 Value of Bitcoin in U.S. Dollars, January 2011 to August 2017 


debased by a government’s money supply policies? This can be particularly attractive for 
individuals living in developing countries such as Venezuela or Zimbabwe where the 
value of the domestic currency is highly volatile. Payments are secure, private, and very 
low cost. An individual can remain anonymous while using a cryptocurrency. 

The disadvantage of a cryptocurrency is that its value is more volatile than other 
fiat (traditional) currencies.* It pays no interest. It is often associated (fairly or unfairly) 
with illegal activities such as drug dealing. Jamie Dimon, CEO of JPMorgan, said in 
September 2017, “Bitcoin is a fraud that will ultimately blow up.” China introduced 
regulations concerning cryptocurrencies in 2017 and other governments may do so as 
well. Users of cryptocurrencies are reliant on the way a cryptocurrency has been set up. If 
a flaw were found, the currency would be worthless. The bitcoin protocol has not been 
successfully hacked, but bitcoin exchanges have been. (See Business Snapshot 28.1.) 

There are now many cryptocurrency alternatives to bitcoin. Examples are lite- 
coin, ripple, dogecoin, nxt, monero, ethereum, and zcash. Some have procedures for 
making transactions secure that are quite different from those used by bitcoin. Some 
cryptocurrencies such as ethereum facilitate smart contracts. These are contracts that 
execute automatically when all conditions are met, thereby eliminating the need for 
intermediaries. 


3 The increase in the supply of bitcoin is determined by the amounts paid to miners (participants) 
when they verify transactions, as described in Section 28.1.2. 

‘The volatility of the U.S. dollar value of bitcoin calculated from the data between August 2006 and 
August 2007 is about 57% per year. The volatility of a fiat currency such as the euro, British pound, 
or Japanese yen when expressed in terms of the U.S. dollar is typically less than 20% per year. 
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BUSINESS SNAPSHOT 28.1 
Mt. Gox 


Bitcoin is a secure means of payment, but some bitcoin exchanges, which allow 
individuals to trade bitcoins for conventional (fiat) money, have had problems. 
There have been a number of reports of exchanges being hacked. The most high- 
profile one concerns Mt. Gox, which is based in the Shibuya district of Tokyo. 
In February 2014, it announced that bitcoins worth more than $450 million 
were missing and likely stolen. The reason for the loss, which happened over a 
number of years, is unclear. It appears that bad management and poorly controlled 
computer code allowed hackers to steal bitcoins. Some have blamed the CEO for 
embezzling the funds, but this has been denied. Whatever the reason, Mt. Gox 
filed for bankruptcy in April 2014. This damaged the image of cryptocurrencies, 
but, as Figure 28.1 indicates, the value of bitcoin recovered and then improved 
dramatically in 2017. 

How can you avoid these types of losses? Bitcoins should not be stored on 
an exchange. They should be transferred to a trustworthy online exchange only 
when you want to trade them. 


28.2.2 Central Banks 


Central banks are researching the use of cryptocurrencies. There may be an element 
of “if you can’t beat them, join them” about this. But a digital currency issued by a 
central bank could make it easier for the central bank to control the money supply. In 
the current system, banks create money. (A bank lends money to an entity; that entity 
puts the money on deposit with a bank or uses it to buy something from another entity 
that puts it on deposit with a bank; the bank receiving the funds relends it; and so on.) 
In a digital currency regime, the total amount of currency could be fixed unless the 
central bank makes an explicit decision to increase it. There could also be less need 
to guarantee bank deposits through programs like those offered by the Federal Deposit 
Insurance Corporation. The payment systems in countries such as the United States that 
have many small banks could be simplified. Another advantage, particularly in developing 
countries, could be financial inclusion. Individuals who, for whatever reason, do not have 
bank accounts could become part of the financial ecosystem. 

One can imagine a country where all citizens have a digital currency account at 
the central bank and private companies compete to provide payment services. The cen- 
tral bank can increase the money supply by adding to the holdings of all citizens in an 
equitable way. (It might do this to increase aggregate demand.) 

Some of the implications ofall this are of course uncertain. Replacing banknotes with 
digital currencies would make it much easier for a resident of one country to transact in 
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the digital currency of another country. This could lead to a new form of competition 
between the currencies of different countries and could actually end up making it more 
difficult for any one country to manage its economy by controlling the money supply. 


28.3 Lending 


As all students of banking know, there are a number of Cs associated with lending: char- 
acter, capacity, capital, collateral, and so on. Can technology automate lending decisions? 
Some banks think it can. Loan officers are being replaced by systems involving machine 
learning in some large banks. Given enough data about a bank’s lending experience, it 
is considered that a machine learning algorithm can sort good loans from bad loans as 
well as or better than a human being can. In principle, a machine learning program can 
be more objective and exhibit less bias than a human being. 


28.3.1 P2P Lending 


A bank’s net interest margin is determined by the difference between the interest it pays 
on liabilities and the interest it earns on assets. If it obtains deposits on which it is paying 
2% per annum and then lends the funds out at 5% per annum, its net interest margin is 
3%. Some of this (perhaps as much as 1%) is to compensate for expected loan losses, but 
if the depositor could lend money directly to the borrower at, say, 4%, both sides could 
potentially be better off. Peer-to-peer (P2P) lending involves disintermediation followed 
by reintermediation. Banks are no longer intermediaries, but new intermediaries are set 
up to provide services such as: 


1. Providing online platforms for borrowers and lenders to contact each other. 

2. Verifying the borrower’s identity, bank account, employment, income, and so on. 

3. Assessing the borrower’ credit risk and, if the borrower is approved, determining the 
appropriate interest rate. 

4. Processing cash flows between borrowers and lenders. 

5. Attempting to collect payments from borrowers who are in default. 


Many borrowers who currently use P2P platforms have already been refused by 
banks, so the interest rates can be quite high compared with conventional loans (but lower 
than the rates on credit card balances and other sources of credit for moderate- to high- 
risk borrowers). Two well-known P2P lending platforms in the United States are Prosper 
and Lending Club, both based in San Francisco® The P2P lending platform assigns a 


> But it has been shown that machine learning is not completely free from bias. The data that are used 
by the machine learning algorithm may incorporate biases. The machine learning programmer may 
consciously or unconsciously incorporate biases into the way algorithms are used and trade-offs are 
made. 

é For regulatory reasons, the P2P company enters into an arrangement with a bank so that the loan is 
passed through the bank and the lender obtains a security in the loan. 
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Table 28.1 Published Interest Rates Paid by Borrowers and Returns 
Received by Lenders on Lending Club Loans, June 30, 2017 


Grade A B Cc D E F+G 


Average interest rate (%) 7.26 10.79 14.01 17.20 19.94 24.04 
Average net annual return (%) 4.86 6.29 669 632 5.59 4.52 


credit rating to a borrower in much the same way that a bank does. Lending Club, for 
example, categorizes borrowers by assigning a letter grade between A and G. The interest 
rate charged to the less creditworthy borrowers is higher than to A-grade borrowers, but 
the expected loss from defaults is also higher. Statistics published by Lending Club in 
June 2017 are summarized in Table 28.1. This shows that both interest rates and loan 
losses are higher than on most loans made by banks. However, the net annual returns that 
investors receive on average are quite attractive compared with other opportunities. 

The fees at P2P lenders can be quite high. At Lending Club, for example, the bor- 
rower pays an origination fee typically between 1% and 5% of the amount borrowed. 
The lender pays a service fee (typically about 1%) on payments received and may also 
have to pay costs associated with collections on delinquent accounts. 

It clearly makes sense for lenders to be diversified. Typically, they can either choose a 
portfolio of loans themselves or ask the P2P lending platform to construct a portfolio that 
is consistent with their risk preferences. The risk of borrowers is often idiosyncratic so that 
correlations between the performances of different loans are low, making diversification 
easier. 

Some lending platforms can be criticized because they have no skin in the game. If 
loans do not perform as well as expected, the lender bears the entire cost. A exception 
is Upstart (started by former Google employees in 2014). It has a different model from 
Lending Club and Prosper. It charges borrowers an origination fee but does not charge 
lenders a fee. It uses the origination fee to reimburse lenders if a loan defaults, giving it a 
stake in the performance of the loan. Its credit assessments, which look at the borrower’s 
educational background as well as FICO scores and other data, have proved to be quite 
accurate and it has grown fast. 

P2P lending has not been immune to scandal. The founder of Lending Club (which 
used an IPO to become a public company in 2014) had to step down in 2016 as a result 
of a governance scandal (but the company seems to have bounced back). In China, retail 
investors have lost billions of dollars in incidents where P2P platform operators have 
simply disappeared with investors’ cash. (This has led to a crackdown on the industry 
by Chinese regulators.) All financial innovations are liable to have teething troubles of 
this sort. Indeed, banks over their long history have had their fair share of scandals. The 
real question for P2P lending is whether it will make inroads into traditional bank lend- 
ing. Will P2P lending become a widely used option for financing the purchase of cars 
and houses? Will P2P between corporations become more common? Because P2P plat- 
forms are relatively new, it will be interesting to see how they perform in an economic 
downturn or when interest rates increase. 
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Some P2P platforms allow lenders to sell a loan before it has been fully repaid, but 
this depends on the willingness of another client of the P2P lending platform to take 
over the loan. There is likely to be a natural assumption that if a lender wants to sell a 
loan there is something wrong with it” However, it may just be that the lender has run 
into liquidity problems so that he or she needs cash. Creating a better secondary market 
for loans is a way the market can evolve. An idea that can be used for increasing liquidity 
is to create tranches from portfolios of loans as described in Chapter 6 and sell securities 
created from them to lenders. 


28.3.2 Crowdfunding 


Crowdfunding is another activity that has the potential to be a disruptive force in finance. 
It involves obtaining funds for a project from a large number of different people. It 
is estimated that over $34 billion was raised in this way in 2015. Some crowdfunding 
projects are donation-based with the purpose of funding projects that donors identify 
with and want to see funded. Kickstarter is a well-known platform for handling such 
projects. 

Other crowdfunding platforms are designed to provide a return to investors. Funding 
Circle is a successful British crowdfunding platform where investors lend to small busi- 
nesses. Companisto is a crowdfunding platform designed to provide funds for start-ups 
and high-growth companies. Investors receive either a fixed rate of interest or an equity 
investment. SyndicateRoom provides equity investments in early-stage businesses. 

Crowdfunding has opened up equity investing in private companies to a much wider 
range of investors than venture capitalists and angel investors. The valuation of the com- 
pany is a critical part of equity crowdfunding. If a company is valued at $500,000, then 
shares representing 1% of the company should require an investment of $5,000; if it is 
valued at $1 million, the required investment is $10,000. Sometimes the company deter- 
mines the value and sometimes investors do so. In the latter case, some platforms use expe- 
rienced professionals to carry out due diligence and ensure that an investment is properly 
priced. Ideally, the professional is an angel investor who has some skin in the game. Of 
course, he or she will typically require to be compensated in some way, and it should be 
recognized that, for even an experienced professional, valuations are highly subjectives 

The growth of equity crowdfunding has been slowed by regulation. Most countries 
have strict rules on the issuance of debt and equity securities. In the United States, the 
Jumpstart Our Business Startups (JOBS) Act of 2012 provides exemptions from the 
rules for certain crowdfunding activities. Under the JOBS Act, the maximum amount of 
funding a company can raise in a year is limited to about $1 million, and the maximum 
funds an investor can provide is subject to a limit based on the investor’s income. 


7 Economists refer to this as the “market for lemons” problem following a 1970 paper by George 
Akerlof with that name. 

8 As entertaining television shows such as Shark Tank illustrate, experts do not always agree when 
valuing early-stage businesses. 
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Initial coin offerings (ICOs) are a form of crowdfunding involving cryptocurrencies. 
Typically a company issues to investors its own cryptocurrency in exchange for an estab- 
lished one such as bitcoin or ethereum. The investors have no claim on the company, 
but if the company does well, they expect the cryptocurrency it has issued to increase in 
value. This is because use of the new cryptocurrency is integral to the operations of the 
company. ICOs have attracted regulatory attention. China has banned them, and other 
countries are considering whether they should be regulated in the same way as IPOs. 


28.4 Wealth Management 


Wealth management has traditionally been very profitable for banks. Fees are often in 
the 1% to 1.5% range of the amount invested per year and can be much more when 
hidden fees associated with mutual fund investments and trading costs are taken into 
account. Once a client’s risk appetite has been assessed, wealth management involves find- 
ing appropriate investments for the client. Often similar investments are recommended 
to all clients with similar risk appetites, easing the job of being a wealth manager. It is not 
surprising that wealth management is an industry that is in the process of being disrupted. 
As mentioned in Chapter 4, John Bogle took the first step toward reducing the costs of 
investing with the first index fund in 1975. Index funds have since become very popular 
and charge fees as low as 0.15% with no human intervention in the form of a wealth 
manager being necessary. 

Robo-advisers first appeared in about 2010. In most countries they must register 
with the authorities and are subject to regulation. They provide digital platforms where 
investors express their risk preferences. A portfolio, usually involving ETFs tracking an 
equity index and safe investments, is then chosen, and the portfolio is automatically rebal- 
anced as necessary. There is very little human intervention. Fees are lower than those 
charged by traditional wealth managers, typically 0.50% to 0.75% of the amount invested 
per year. Well-known companies include Wealthfront and Betterment. Some banks and 
other wealth managers are now responding to the competition by offering their own 
automated wealth management services. Indeed, those that fail to do this are unlikely to 
survive. Providers of index mutual funds, such as Vanguard, are also active in this space. 

Robo-advisers are making investment advice available to a much wider range of indi- 
viduals than previously. Often investors can start with as little as $500 or $1,000 whereas 
a traditional wealth manager might require a minimum investment of $50,000. Robo- 
advisers make it easy for clients to add to the funds under management on a regular basis. 
Arguably they serve an important role in society by encouraging people to save when 
they might not otherwise do so. Robo-advising tended to attract young investors with 
small amounts to invest in its early days, but a much wider range of investors, including 
those classified as “high net worth” and HENRYs (high earners not rich yet) are now 
using the services. 

Up to now the main innovation underlying robo-advising has been the delivery 
of services in a cheaper, novel way that many investors find appealing. The investment 
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strategies underlying the advice given are usually similar to those that have been used by 
the investment industry for many years (see Chapter 1). Tax-related strategies (such as 
tax loss harvesting) are often incorporated into the advice that is given.” 

There is plenty of scope for the strategies to become more sophisticated. Investments 
can be better diversified internationally and across sectors. They can be better targeted 
to the goals of the investor, taking into account the investor's age, retirement plans, plans 
to pay for the higher education of children, and so on. 

In 1992, Fischer Black and Robert Litterman at Goldman Sachs published a widely 
used way of incorporating the views of investors in the selection of a portfolio.!® 
Robo-advisers may find ways of expanding the range of alternatives offered to investors 
using this technology. Alternative sets of views with rationales could be presented, with 
investors being invited to choose between them. It might even be possible to let the 
views of the investor be a less structured direct input to the determination of the 
portfolio. 

In Chapter 27, we mentioned that a number of cognitive biases influence decision 
making. Investors are subject to these biases. For example, they are reluctant to sell losers, 
they chase trends, and they get disillusioned and exit equity markets when they should 
stay for the long term. It is often the ability to avoid these biases that distinguishes a 
professional investor from an amateur. Robo-advisers can try to stop investors falling 
victim to the biases by developing innovative ways of explaining them. Finally, robo- 
advising can be combined with other financial innovations so that a percentage of a 
client’s funds is allocated to P2P lending or equity crowdfunding. 

Robo-advising has already become an important part of the financial landscape and is 
likely to become more widely used as the millennial generation accumulates wealth. For 
this generation, it is much cooler to invest with an iPhone than make a trip to the bank. 
However, it is perhaps worth sounding one note of caution. Equity markets performed 
really well in the years following the start of robo-advising in 2010. The appeal of robo- 
advising may decline when there is a downturn and the clients of robo-advisers, who 
never invested before, complain about losing money. It is hoped that the advisers will be 
able to educate investors on the importance of staying focused on the long term. 


28.5 Insurance 


The application of technology in the insurance industry is referred to as insurtech. In some 
respects insurance companies are less likely to be disrupted by technology innovations 
than banks are. This is because the nature of insurance is such that consumers require a 
conservatively run, stable company. For example, it does not make sense to buy house 


? Tax loss harvesting involves selling stocks with capital losses at the year end to reduce the capital 
gains tax payable. 

10 See F Black and R. Litterman, “Global Portfolio Optimization,” Financial Analysts Journal 48, no. 5 
(1992): 28-43. 
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insurance if there is a non-negligible probability that the insurance company will not be 
around in the event of the house being destroyed by fire. Regulators monitor the financial 
condition of insurance companies carefully, and it is very difficult for a new company 
to offer insurance unless it has a large amount of capital to cover claims. However, some 
applications of technology in insurance are straightforward. For example, it is now easy 
to compare the quotes of many different insurance companies online. 

Usually customers do not deal directly with insurance companies. They have to go 
through brokers or agents. These intermediaries are sometimes tied to just one insurance 
company or to a small number of insurance companies, rather than having a free hand to 
find the best deal for clients. A natural development would be disintermediation where 
insurance companies create websites for dealing directly with customers. However, this 
seems to be happening fairly slowly. Insurance companies are in many cases reluctant to 
upset the brokers and agents with whom they have worked closely for many years. But 
eventually as the market develops, brokers and agents are likely to either disappear or 
become digitized providers of price comparisons. 

One important potential insurtech application is the collection of data by insur- 
ance companies so that risks are assessed more accurately. Auto insurance is a good 
example. By placing a “black box” in a car, an insurance company can use telematics 
(the branch of information technology that deals with the long-distance transmission 
of computerized information) to accumulate data on a client’s driving. The black box 
can record driving speeds, distances traveled, the time of day or night that the car is 
being driven, driving style (e.g., frequency of braking and the cornering speed), and the 
type of road usually used. This enables the insurance company to build up an accu- 
rate profile of the driver. If the driver is less risky than average, the insurance premium 
charged will go down. Similarly, if the driver is more risky than average, it will go up. 
Black box insurance is still in its infancy, although it has made some inroads into the 
UK insurance market. As the cost of car insurance rises, it may become more popular, 
particularly with young people. The latter have traditionally faced very high premiums 
when they have little or no driving record. A black box could provide a quicker way 
for them to convince insurance companies that they are good drivers. Another point to 
note is that the cost of a black box (which is typically paid for by the driver) has come 
down. 

The black box can give the driver information on road conditions. It can also be 
used for driver education. The driver may be able to access a secure website to find out 
how well he or she is performing. This can suggest changes to driving style and provide 
tips on bringing down the cost of insurance. In principle, this could improve the average 
ability of drivers and make roads safer. Of course not everyone can expect a reduced 
premium as a result of a black box, and some drivers may regard them as an invasion of 
privacy. One can speculate that as black boxes become more widely accepted, only good 
drivers will use them and the premiums for non-users will increase. 

To the insurance company, the key advantage of a black box is that it is able to 
collect data allowing risks to be assessed more accurately. In an ideal world, an insurance 
premium charged should reflect miles driven and the risks incurred by the driver per 
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mile.'! Another potential advantage of black boxes to the insurance company is that they 
may be able to pinpoint blame in the event of an accident and make claims simpler. 

A general point is that insurance companies now have opportunities to collect more 
data than ever before to assess the risks they are taking. These data can potentially allow 
them to improve estimates of an individual’s life expectancy, the probability of a partic- 
ular house burning down, or the chance of a hurricane in a certain part of the world. 
Machine learning provides the tools to analyze the data. The insurance industry is by 
nature conservative (perhaps rightly so), and it is difficult to imagine insurtech radically 
changing the structure of the industry. (In many cases, new entrants have to partner with 
established insurers.) But the companies that prosper are likely to be those that take 
advantage of the opportunities created by what is sometimes referred to as “big data.” 


28.6 Regulation and Compliance 


There are two aspects of the impact of technology on regulation. One is how regulators 
should handle companies that use technology in innovative ways and have the potential 
to be a disruptive force in finance. The other, known as Reg Tech, is concerned with how 
regulation itself can use technology. We will consider both in this section. 


28.6.1 Regulation of Financial Innovators 


As we have seen in this book, financial institutions are highly regulated. They have to 
deal with many different regulatory agencies. This is a barrier to entry for small FinTech 
start-ups. P2P lenders in the United States must enter into an arrangement with a bank 
to comply with regulations. Robo-advisers must satisfy know-your-client rules, even 
though there is limited or no human interaction in their services, and register with the 
authorities. If a FinTech ignores regulations, it is likely to be closed down by supervisory 
authorities, at least temporarily, and suffer reputational damage. 

Regulators do not want to stifle financial innovation. Indeed, as we have pointed out, 
financial innovation can bring many benefits to society by developing innovative ways 
of saving, investing, and borrowing so that more people are encouraged to be part of 
the financial ecosystem. But regulators must tread a fine line. They do not want shadow 
banks (i.e., non-banks that are offering similar services to banks) to subject clients to 
unreasonable risks. 

The Financial Conduct Authority (FCA) in the United Kingdom has developed a 
“sandbox” program.'* It defines a sandbox as a safe space in which businesses can test 


11 Metromile, a San Francisco company, is an example of an insurance company moving in this direc- 
tion. It uses a gadget to monitor mileage driven and sends a monthly invoice for insurance to pol- 
icy holders to reflect the records of miles driven. The policies are actually written by a large well- 
capitalized and highly regulated insurance company, National General Insurance. 

!2See Financial Conduct Authority, “Regulatory Sandbox,” November 2015. 
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innovative products, services, business models, and delivery mechanisms without imme- 
diately incurring all the normal regulatory consequences of pilot activities. Innovators 
must fill out an application form explaining what they are planning to do. The FCA 
encourages virtual testing before real consumers are used and has considered four other 
ways in which consumers can be protected: 


1. Consumers must give informed consent to be included in testing. They are notified 
of risks and available compensation. 

2. The FCA agrees, on a case-by-case basis, to the disclosure and compensation. 

3. Customers have the same rights as customers of other firms that are established and 
authorized by the FCA. 

4. The businesses undertaking sandbox trials must compensate individuals for any losses 
and must demonstrate that it has the resources to do this. 


The FCA seems to prefer the second approach, which can include one or more of the 
other approaches. 

Despite pressure from technology companies and from forward-looking established 
financial institutions, the United States does not have a sandbox program. Some contend 
that this could put the country at a competitive disadvantage. As mentioned earlier, it 
does have the JOBS Act, which eases the regulatory burden on crowdfunding activities. 

Regulating FinTech companies is likely to be a challenge for regulators in the future. 
The main asset of a FinTech company is often its computer code, and it is not at all 
easy for supervisors to evaluate this in the same way that they would evaluate how a 
bank does its risk management. High-frequency traders (HFTs) illustrate the problem. 
Their trading activities are determined entirely by their computer code. On May 6, 2010, 
largely as a result of the activities of HFTs, a “flash crash” occurred in the United States 
where the Dow Jones Industrial Average declined by about 9% in a matter of minutes 
before partially recovering. On August 1, 2012, a software error in the program used 
by an HFT caused a major disruption in the prices of 148 stocks listed on the New 
York Stock Exchange. The Securities and Exchange Commission has taken some steps 
to avoid similar problems in the future, but it is difficult for it to guard against all the 
ways in which software can (intentionally or unintentionally) interfere adversely with 
the functioning of financial markets. 

Another issue for regulators is that the jurisdiction may be difficult to determine for 
some FinTech companies. This is particularly likely to be the case if all of its transactions 
are carried out in digital currencies. 


28.6.2 RegTech 


RegTech is the application of technology to regulation and compliance. Some of the 
applications are straightforward. Regulatory reporting has traditionally involved thou- 
sands of pages of documents being provided to regulators at the end of each reporting 
period (e.g., every quarter). Moving this online and providing data in real time have the 
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potential to simplify the job of both the regulator and the regulated financial institution, 
while making regulation more timely and efficient. 

Fines for failing to comply with regulations can be steep.!? For example, in Chapter 
23 we mentioned that a French bank, BNP Paribas, was fined an amount equal to its 
profits for a whole year for violating U.S. sanctions. No doubt this happened because the 
activities of a small number of employees were not being monitored closely by the bank’s 
compliance group. Technology can make it easier for a bank to ensure that it complies 
with regulations and the laws of the different countries it operates in. For example, tech- 
nology can make it possible for it to screen both new customers and new transactions 
in real time to identify problems. Compliance violations involving issues such as drug 
money laundering, terrorist financing, and sanctions can be effectively eliminated if a 
real-time RegTech application, approved by regulators and reflecting the latest rules, is 
used. 

Preventing staff from engaging in questionable behavior is important to financial 
institutions, as the behavior can prove to be very costly in terms of regulatory fines, 
legal costs, and reputation. Some innovations allow companies to keep track of their 
employees’ conversations across a number of communications platforms. For example, 
Digital Reasoning, a company headquartered in Nashville, with offices in London, New 
York, and Washington, has created surveillance software used by some of the largest 
banks and asset management companies. It can monitor English conversations (handling 
six different dialects). It can monitor employee behavior by analyzing millions of e-mails, 
chat logs, and phone conversations to detect suspicious or unusual activity using machine 
learning. If machine learning detects an employee whose behavior is quite different from 
the usual pattern, further investigation might be warranted. In an asset management firm, 
the behavior might prove to be indicative of insider trading, which, if allowed to continue, 
could lead to a huge fine. In a bank, it might prove to be indicative of rogue trading or 
a failure to treat subordinates with respect. 

As mentioned in Chapter 23, cyber risk is the biggest operational risk for banks. It is 
also becoming a regulatory issue. For example, the General Data Protection Regulation 
in Europe begins in May 2018 and will see financial institutions charged hefty fines if 
they lose data meant to remain confidential. Some RegTech applications are helping 
banks keep their data secure and comply with regulations like this one. 

Just knowing all the regulations that exist is a major problem for financial institutions, 
as the regulatory environment it operates in throughout the world changes almost daily. 
IBM’s Watson Regulatory Compliance gives relevant employees access to a library of 
regulatory requirements in which materials can be filtered geographically, by business 
type, by product, and by compliance area. This application can automatically highlight 
the parts of a new regulation that are relevant to a particular issue and compare the 
regulations related to a particular activity in different parts of the world. 


'5Between the end of the 2007-2008 crisis and 2017, banks throughout the world have paid about 
$321 billion in fines. Technology to improve a bank’s compliance can therefore have a huge payoff. 
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As explained in Part Three of this book, regulation has moved from being unidi- 
mensional to being multidimensional. Regulators used to focus on only a single capital 
measure. They now have two capital measures (one based on risk-weighted assets and 
the other based on a simpler leverage calculation) and two liquidity ratios (the liquidity 
coverage ratio and the net stable funding ratio). Different activities undertaken by the 
bank have different effects on the ratios. This leads to the potential for technology firms 
to come up with ways of optimizing bank strategies or providing convenient scenario 
analysis tools. 


28.7 How Should Financial Institutions Respond? 


FinTech innovations pose a strategic risk for large banks and other financial institutions. 
Many of the services they offer will be disrupted. How should they respond? Clearly they 
should embrace technological change, rather than hoping it is a fad that will disappear. 
But they should also carefully evaluate how consumer behavior is being affected by 
technological change and take steps to change their business model accordingly. 

Business Snapshot 28.2 describes Eastman Kodak, a company that did not survive 
technology changes well. A key point is that Kodak was not unaware of the changes 
taking place in its industry. Indeed, the first digital camera was created in 1975 by an 
engineer working for Kodak, and Kodak invested billions in the new technology. So 
where did Kodak go wrong? It understood the new technology but did not appreciate 
the way it was changing consumer behavior until too late. In the course of a short period 
of time, a large segment of the population (not just young people) became comfortable 
sharing photographs and storing them on computers. They no longer felt the need to 
print photographs. 

Kodak coined the term “Kodak moment,’ which it used extensively in its advertis- 
ing to convince people that they should have a Kodak camera loaded with Kodak film 
ready to use at all times. It is easy to be wise after the event, but strategy professionals 
would argue that the company could have extrapolated from its sales pitch to recognize 
the business it was in. It was in the imaging or moment-sharing business rather than 
the film business. We will never know how well it would have survived if it had more 
enthusiastically embraced electronic sharing and the storing of digital images when the 
market was in its early stages. (As indicated in Business Snapshot 28.2, Fuji, which was a 
similar company to Kodak but based in Japan, fared better than Kodak.) It was its implicit 
belief that demand for hard-copy photographs would continue that ultimately doomed 
the company. 

The disruption of large financial institutions does not seem to be happening as 
quickly as the disruption of Kodak. They have a number of competitive advantages. 
They are well capitalized (although the same is true of Apple, Google, and Alibaba). 
They understand how to deal with the highly regulated environment they operate in 
(something many FinTech start-ups find difficult). They have a huge customer base 
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BUSINESS SNAPSHOT 28.2 
Eastman Kodak and the Digitization of Photography 


In the 1980s and 1990s, the Eastman Kodak company was a successful business 
selling cameras and film for cameras. It had almost single-handedly transformed 
photography in the United States from something that was the exclusive preserve 
of professional photographers to something that almost everyone did. It had 
successfully navigated technological changes in its industry. It moved from dry- 
plate to black-and-white film and then sensibly invested in color film even when 
it was demonstrably inferior to the black-and-white film from which it derived 
most of its revenue. Its market capitalization in 1997 was about $30 billion. 

Steve Sasson, an engineer at Kodak, invented the first digital camera in 1975. 
According to Sasson, the company’s response was “That’s cute—but don’t tell 
anyone about it.’ In fact, Kodak did not ignore the trend to digital cameras. 
It invested heavily and in 1995 it brought its first digital camera to market, 
the DC40. In 2001, it launched the EasyShare line of point-and-shoot cameras. 
Kodak had a first-rate research capability. It had the ability to develop state-of- 
the-art cameras that could enhance colors, automatically adjust lighting, organize 
portfolios, share photos, detect smiles to get the best shots, and so on. 

Unfortunately, Kodak exhibited some reluctance to continually improve 
products and innovate, a mistake in an industry where technology changes fast. 
Cannibalizing existing products in the name of progress is painful, but necessary 
(even when the new products are less profitable than the old ones). How well 
Kodak would have survived the smartphone revolution if it had adopted a differ- 
ent strategy is debatable. All we can say is that some of its competitors such as Fuji 
have fared better. In 2017, Fuji’s market capitalization was more than $15 billion. 
Kodak filed for bankruptcy at the end of 2012, and its successor company had a 
market capitalization of about $300 million in 2017. 


and are still trusted by many customers (although it is important to recognize that 
the 2007—2008 financial crisis and events since then such as the Wells Fargo debacle 
described in Business Snapshot 27.1 have served to erode this trust). One can speculate 
that financial institutions are not as vulnerable as Kodak in that many people are less 
inclined to experiment with the way their money is handled than with the way they 
do photography. Also, many start-ups need established financial institutions to offer their 
products. 

However, there are some warning signs that banks should respond to. The Millennial 
Disruption Index survey indicated that 71% of millennials would rather visit the den- 
tist than listen to what banks are saying, while 73% would rather handle their financial 
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services needs through Google, Amazon, Apple, PayPal, or Square.'+ Millennials also 
voted four leading U.S. banks among their least loved brands. Millennials are individ- 
uals born between 1981 and 2000 and are important future consumers of bank services. 
By 2022 they will make up 40% of the workforce and by 2050 they will have inherited 
about $30 trillion. They may feel less radically inclined about financial services as they 
age, but banks should not rely on this. 

Kodak was ultimately rendered largely irrelevant by the digital cameras incorpo- 
rated in smartphones. Financial institutions naturally do not want to become similarly 
irrelevant. They have recognized the need to offer mobile apps for payments, wealth 
management, and a host of other services. But it is important for them to embrace tech- 
nological change, not just to pay lip service to it. Technological change in the financial 
sector will continue at an accelerating rate. In many cases it will erode the profits banks 
previously relied upon (as was the case for Kodak). Being flexible enough to adjust will 
be a continuing challenge. 

The new services developed by banks need to be convenient and designed so that 
young people classify them as “cool” while older people find them easy to use. There are 
a number of different approaches being adopted. Some financial institutions have devel- 
oped new services in-house; some have bought start-ups that have already developed 
the services; and some have entered into partnerships with start-ups. The first alterna- 
tive, although the least expensive and most appealing to many working in the financial 
services sector, can be quite difficult given the somewhat complacent culture that often 
permeates large companies. The second and third alternatives can be used as a way of 
disrupting the culture and accelerating change. Some banks have found it useful to cre- 
ate an organizationally distinct unit that has the ability to bring in outside talent when 
necessary and can partner with start-ups. Whatever approach is adopted, strong support 
from the CEO and the board is necessary for success. 

The financial institutions that prosper will be the ones that manage to change their 
culture so that they are able to provide services in a fast, flexible way that appeals to 
consumers. Lou Gerstner was appointed CEO at IBM in the 1990s when the company 
was losing money. He successfully changed the culture and transformed it into a profitable 
company. He is quoted as saying, “Culture isn’t just one aspect of the game. It is the game.” 
He successfully changed IBM from being a supplier of mainframe computers to being a 
business problem solver where teams of people worked well together. Banks are similar 
to IBM in that they are huge organizations with many bright employees. The job of the 
CEO is to create a culture where individuals are rewarded for embracing technology and 
finding innovative ways of making their services more convenient and valuable for clients. 


Summary 


What will the financial services sector look like in, say, 2035? Of course, the pace of 
technological change is so fast that accurate forecasts are impossible. However, offering 


l4See www.millennialdisruptionindex.com. 
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a few predictions is a convenient way of summarizing some of the points made in this 
chapter. 

Checks and credit cards will largely disappear. Mobile wallets on smartphones and 
wearables will become the norm. Biometrics will be used to make payments more secure. 
Some central banks will choose to switch from paper currencies to digital currencies, and 
transacting in digital currencies will become more widespread. Machine learning will be 
able to undertake many tasks such as credit assessment and fraud detection much better 
than human beings. Record keeping will be more secure and faster using blockchains 
and other DLTs. 

Large insurance companies will continue to exist, but brokers and agents will not. 
Insurers will have access to more data than ever before and will use machine learning and 
other techniques to assess risks more precisely. There will be partnerships formed between 
insurance companies and technology firms to deliver insurance services in innovative 
ways. 

Investing and borrowing will see big changes. Crowdfunding and P2P lending will 
become more widely used. Some of today’s large banks will find it necessary to offer 
these services to savers and borrowers. It will be unusual for funds to remain idle for any 
length of time, and the ease of borrowing and investing will make more people part of 
the financial ecosystem. 

Some banks that fail to adjust to new technology enthusiastically will fail and be 
acquired by other banks. (Indeed, too-big-to-fail may no longer be considered a relevant 
descriptor of very large banks.) Some large technology firms will become banks and 
offer a full range of banking services. Most bank services will be offered more cheaply 
and more efficiently than before so that banks will lose some of their traditional sources 
of profit. The banks that survive will have to cut costs by making big reductions in the 
number of branches they have and the number of people they employ. Banks will have to 
form partnerships with many different technology firms, to keep the services they offer 
up to date. 
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Practice Questions and Problems 
(Answers at End of Book) 


28.1 Explain the terms disintermediation and reintermediation. 
28.2 What is meant by (a) machine learning and (b) distributed ledger technologies? 
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28.3 What is meant by biometric authorization? Give some examples. 

28.4 Why might a central bank switch from fiat money to digital money? 

28.5 Explain how P2P lending and equity crowdfunding work. 

28.6 “When thinking about the impact of FinTech, financial institutions should be 
thankful for regulation.” Explain. 

28.7 “Index funds are a great idea, but if everyone chose them there would be no price 
discovery.’ Do you agree with this argument? 

28.8 What is meant by RegTech? Give some examples. 

28.9 “Banks should aim to respond to change like IBM, not like Kodak.” Discuss this 
statement. 


Further Questions 


28.10 What is meant by a bubble? Consider whether the increase in the price of bitcoin 
in 2017 is a bubble. 

28.11 Look at the data in Table 28.1. Is Lending Club good at assessing risk? Is there a 
reasonable trade-off between risk and return for lenders? What risks are lenders 
taking? 

28.12 With the benefit of hindsight, we can say that Kodak was in the imaging and 
moment-sharing business. What business are banks in? 


Chapter 29 


Risk Management Mistakes 
to Avoid 


ince the mid-1980s, there have been some spectacular losses in financial markets. 
S This chapter uses them to review some of the key points made in this book. The 
losses that we will consider are listed in Business Snapshot 29.1. 

One remarkable aspect of the list in Business Snapshot 29.1 is the number of times 
huge losses were caused by the activities of a single person. In 1995, Nick Leeson’s trading 
brought a 200-year-old British bank, Barings, to its knees; in 1994, Robert Citron’s trad- 
ing led to Orange County, a municipality in California, losing about $2 billion. Joseph 
Jett’s trading for Kidder Peabody caused losses of $350 million. John Rusnak’s losses of 
$700 million at Allied Irish Bank came to light in 2002. Jérôme Kerviel lost over $7 
billion for Societé Générale in 2008. Kweku Adoboli lost $2.3 billion for UBS in 2011. 

The losses emphasize the importance of internal controls. Many of the losses we 
will consider occurred because systems were inadequate so that the risky positions being 
taken were simply not known. It is also important for risk managers to continually “think 
outside the box” about what could go wrong so that as many potential adverse events as 
possible are identified. 


29.1 Risk Limits 


The first and most important lesson from the losses concerns risk limits. It is essential that 
all companies (financial and nonfinancial) define in a clear and unambiguous way limits 
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BUSINESS SNAPSHOT 29.1 
Big Losses 


Allied Irish Bank: This bank lost about $700 million from the unauthorized 
speculative activities of one of its foreign exchange traders, John Rusnak, which 
lasted for a number of years. Rusnak covered up his losses by creating fictitious 
options trades. 

Barings: This 200-year-old British bank was wiped out in 1995 by the activ- 
ities of one trader, Nick Leeson, in Singapore. The trader’s mandate was to arbi- 
trage between Nikkei 225 futures quotes in Singapore and Osaka. Instead he 
made big bets on the future direction of the Nikkei 225 using futures and options. 
The total loss was close to $1 billion. 

Enron’s Counterparties: Enron managed to conceal its true situation from 
its shareholders with some creative contracts. Several financial institutions that 
allegedly helped Enron do this have each had to settle shareholder lawsuits for 
over $1 billion. 

Hammersmith and Fulham (see Business Snapshot 23.1): This British Local 
Authority lost about $600 million on sterling interest rate swaps and options in 
1988. The two traders responsible for the loss knew surprisingly little about the 
products they were trading. 

Kidder Peabody (see Business Snapshot 25.2): The activities of a single trader, 
Joseph Jett, led to this New York investment dealer losing $350 million trading 
U.S. government securities. The loss arose because of a mistake in the way the 
company’s computer system calculated profits. 

Long-Term Capital Management (see Business Snapshot 22.1): This hedge fund 
lost about $4 billion in 1998 carrying out convergence arbitrage strategies. The 
loss was caused by a flight to quality after Russia defaulted on its debt. 

National Westminster Bank: This British bank lost about $130 million from 
using an inappropriate model to value swap options in 1997. 

Orange County (see Appendix B): The activities of the treasurer, Robert Citron, 
led to this California municipality losing about $2 billion in 1994. The treasurer 
was using derivatives to speculate that interest rates would not rise. 

Procter & Gamble (see Business Snapshot 5.4): The treasury department of 
this large U.S. company lost about $90 million in 1994 trading highly exotic 
interest rate derivatives contracts with Bankers Trust. It later sued Bankers Trust 
and settled out of court. 

Société Générale (see Business Snapshot 5.5): Jérôme Kerviel, an equity trader 
in the Paris office, lost over $7 billion speculating on movements in equity indices 
in January 2008. He is alleged to have concealed his exposure by creating ficti- 
tious trades. Like Leeson at Barings, his mandate was to do arbitrage trades. 
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Subprime Mortgage Losses (see Chapter 6): In 2007, investors lost confidence 
in the structured products created from U.S. subprime mortgages. This led to a 
“credit crunch” with losses of tens of billions of dollars by financial institutions 
and the worst recession since the 1930s. 

UBS: In 2011, Kweku Adoboli lost $2.3 billion taking unauthorized specu- 
lative positions in stock market indices. 


to the financial risks that can be taken. They should then set up procedures for ensuring 
that the limits are adhered to. Ideally, overall risk limits should be set at board level. These 
should then be converted to limits applicable to the individuals responsible for managing 
particular risks. Daily reports should indicate the gain or loss that will be experienced 
for particular movements in market variables. These should be checked against the actual 
gains and losses that are experienced to ensure that the valuation procedures underlying 
the reports are accurate. 

It is particularly important that companies monitor risks carefully when derivatives 
are used. This is because derivatives can be used for hedging or speculation or arbitrage. 
Without close monitoring, it is impossible to know whether a derivatives trader has 
switched from being a hedger to a speculator or switched from being an arbitrageur to 
being a speculator. The Barings, Société Générale, and UBS losses are classic examples of 
what can go wrong. In each case, the trader’s mandate was to carry out low-risk arbitrage 
and hedging. Unknown to their superiors, they switched from being arbitrageurs and 
hedgers to taking huge bets on the future direction of market variables. The systems 
within the banks were inadequate and did not detect what was going on. 

The argument here is not that no risks should be taken. A trader in a financial insti- 
tution or a fund manager should be allowed to take positions on the future direction of 
relevant market variables. What we are arguing is that the sizes of the positions that can 
be taken should be limited and the systems in place should accurately report the risks 
being taken. 


29.1.1 A Difficult Situation 


What happens if an individual exceeds risk limits and makes a profit? This is a tricky issue 
for senior management. It is tempting to ignore violations of risk limits when profits 
result. However, this is shortsighted. It leads to a culture where risk limits are not taken 
seriously, and it paves the way for a disaster. The classic example here is Orange County. 
Robert Citron’s activities in 1991—1993 had been very profitable for Orange County, and 
the municipality had come to rely on his trading for additional funding. People chose to 
ignore the risks he was taking because he had produced profits. Unfortunately, the losses 
made in 1994 far exceeded the profits from previous years. 
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The penalties for exceeding risk limits should be just as great when profits result as 
when losses result. Otherwise, traders who make losses are liable to keep increasing their 
bets in the hope that eventually a profit will result and all will be forgiven. 


29.1.2 Do Not Assume You Can Outguess the Market 


Some traders are quite possibly better than others. But no trader gets it right all the time. 
A trader who correctly predicts the direction in which market variables will move 60% 
of the time is doing well. If a trader has an outstanding track record (as Robert Citron 
did in the early 1990s), it is likely to be a result of luck rather than superior trading skill. 
As our discussion of mutual fund performance in Chapter 4 shows, it appears that fund 
managers usually produce superior returns as a result of luck rather than skill. 

Suppose that a financial institution employs 16 traders and one of those traders makes 
profits in every quarter of a year. Should the trader receive a good bonus? Should the 
trader’s risk limits be increased? The answer to the first question is that the trader will 
inevitably receive a good bonus. The answer to the second question should be no. The 
chance of making a profit in four consecutive quarters from random trading is 0.54 or 
1 in 16. This means that just by chance one of the 16 traders will “get it right” every 
single quarter of the year. We should not assume that the trader’s luck will continue and 
we should not increase the trader’s risk limits. 


29.1.3 Do Not Underestimate the Benefits of Diversification 


When a trader appears good at predicting a particular market variable, there is a tendency 
to increase the trader’s risk limits. We have just argued that this is a bad idea because it 
is quite likely that the trader has been lucky rather than clever. However, let us suppose 
that we are really convinced that the trader has special talents. How undiversified should 
we allow ourselves to become in order to take advantage of the trader’s special skills? As 
indicated in Section 1.1, the benefits from diversification are large. A trader has to be 
very good for it to be worth foregoing these benefits to speculate heavily on just one 
market variable. 

An example will illustrate the point here. Suppose that there are 20 stocks, each of 
which has an expected return of 10% per annum and a standard deviation of return of 
30%. The correlation between the returns from any two of the stocks is 0.2. By dividing 
an investment equally among the 20 stocks, an investor has an expected return of 10% per 
annum and standard deviation of returns of 14.7%. Diversification enables the investor 
to reduce risks by over half. Another way of expressing this is that diversification enables 
an investor to double the expected return per unit of risk taken. The investor would 
have to be very good at stock picking to consistently get a better risk-return trade-off 
by choosing undiversified stock portfolios. 


29.1.4 Carry Out Scenario Analyses and Stress Tests 


As discussed in Chapter 22, the calculation of risk measures such as VaR should always be 
accompanied by scenario analyses and stress testing to obtain an understanding of what 
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can go wrong. Without the discipline of stress testing, human beings have an unfortunate 
tendency to anchor on one or two scenarios when evaluating decisions. In 1993 and 1994, 
for example, Procter & Gamble was so convinced that interest rates would remain low 
that it ignored the possibility of a 100-basis-point increase in its decision making. 

Once stress-testing results have been produced, they should become inputs to a finan- 
cial institution’s strategic decision making. All too often, particularly when times are good, 
the results of stress testing are ignored. This happened at a number of financial institutions 
prior to July 2007. 


29.2 Managing the Trading Room 


š 


In trading rooms there is a tendency to regard high-performing traders as “untouchable’ 
and to not subject their activities to the same scrutiny as other traders. Apparently Joseph 
Jett, Kidder Peabody’s star trader of Treasury instruments, was often “too busy” to answer 
questions and discuss his positions with the company’s risk managers. 

All traders—particularly those making high profits—should be fully accountable. It is 
important for the financial institution to know whether the high profits are being made 
by taking unreasonably high risks. It is also important to check that the financial institu- 
tion’s computer systems and pricing models are correct and are not being manipulated 
in some way. 


29.2.1 Separate the Front, Middle, and Back Office 


The front office in a financial institution consists of the traders who are executing trades, 
taking positions, and so on. The middle office consists of risk managers who are monitoring 
the risks being taken. The back office is where the record keeping and accounting takes 
place. Some of the worst derivatives disasters have occurred because these functions were 
not kept separate. Nick Leeson controlled both the front and back office for Barings in 
Singapore and was, as a result, able to conceal the disastrous nature of his trades from his 
superiors in London for some time. 


29.2.2 Do Not Blindly Trust Models 


We discussed model risk in Chapter 25. Some of the large losses experienced by finan- 
cial institutions arose because of the models and computer systems being used. Kidder 
Peabody was misled by its own systems. Another example of an incorrect model leading 
to losses is provided by National Westminster Bank. This bank had an incorrect model 
for valuing swap options that led to a large loss. 

If large profits are reported when relatively simple trading strategies are followed, 
there is a good chance that the models underlying the calculation of the profits are 
wrong. Similarly, if a financial institution appears to be particularly competitive on its 
quotes for a particular type of deal, there is a good chance that it is using a different 
model from other market participants—which almost certainly means that, before too 
long, it will have to change its model and report a loss. Getting too much business of a 
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certain type can be just as worrisome to the head of a trading room as getting too little 
business of that type. 


29.2.3 Be Conservative in Recognizing Inception Profits 


When a financial institution sells a highly exotic instrument to a nonfinancial corpo- 
ration, the valuation can be highly dependent on the underlying model. For example, 
instruments with long-dated embedded interest rate options can be highly dependent 
on the interest rate model used. In these circumstances, a phrase used to describe the 
daily marking to market of the deal is marking to model. This is because there are no 
market prices for similar deals that can be used as a benchmark. 

Suppose that a financial institution manages to sell an instrument to a client for $10 
million more than it is worth—or at least $10 million more than its model says it is worth. 
The $10 million is known as an inception profit. When should it be recognized? There 
appears to be a lot of variation in what different derivatives dealers do. Some recognize 
the $10 million immediately, whereas others are much more conservative and recognize 
it slowly over the life of the deal. 

Recognizing inception profits immediately is very dangerous. It encourages traders 
to use aggressive models, take their bonuses, and leave before the model and the value of 
the deal come under close scrutiny. It is much better to take reserves and to recognize 
inception profits slowly, so that traders are motivated to investigate the impact of several 
different models and several different sets of assumptions before committing themselves 
to a deal. 


29.2.4 Do Not Sell Clients Inappropriate Products 


It is tempting to sell corporate clients inappropriate products, particularly when they 
appear to have an appetite for the underlying risks. But this is shortsighted. A dramatic 
illustration is provided by the activities of Bankers Trust (BT) in the period leading up 
to the spring of 1994. BT’s clients, such as Procter & Gamble, were persuaded to buy 
high-risk and totally inappropriate products. A typical product would give a client a 
good chance of saving a few basis points on its borrowings and a small chance of losing 
a large amount of money. The products worked well for BT’s clients in 1992 and 1993, 
but blew up in 1994 when interest rates rose sharply. The bad publicity that followed 
hurt BT greatly. The years it had spent building up trust among corporate clients and 
developing an enviable reputation for innovation in derivatives were largely lost as a 
result of the activities of a few overly aggressive salesmen. BT was forced to pay large 
amounts of money to its clients to settle lawsuits out of court. It was taken over by 
Deutsche Bank in 1999. 


29.2.5 Beware of Easy Profits 


Enron provides an example of how overly aggressive deal makers can cost their banks 
billions of dollars. Doing business with Enron seemed very profitable and banks 
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competed with each other for this business. But the fact that many banks push hard 
to get a certain type of business should not be taken as an indication that the business 
will ultimately be profitable. The business that Enron did with banks resulted in 
many very expensive shareholder lawsuits. In general, transactions where high profits 
seem easy to achieve should be looked at closely for potential operational, credit, or 
market risks. 

Investing in the AAA-rated tranches of ABSs and ABS CDOs that were created from 
subprime mortgages (see Chapter 6) seemed like a money-making machine for many 
banks. The promised returns were higher than the returns normally earned on AAA- 
rated instruments. Most traders did not stop to ask whether this was because there were 
risks that had not been taken into account by rating agencies. 


29.3 Liquidity Risk 


We discussed liquidity risk in Chapter 24. Financial engineers usually base the pricing of 
exotic instruments and other instruments that trade relatively infrequently on the prices 
of actively traded instruments. For example: 


1. A trader often calculates a zero curve from actively traded government bonds (known 
as on-the-run bonds) and uses it to price bonds that trade less frequently (off-the-run 
bonds). 

2. A trader often implies the volatility of an asset from actively traded options and uses 
it to price less actively traded options. 

3. A trader often implies information about the behavior of interest rates from actively 
traded interest rate caps and swap options and uses it to price products that are highly 
structured. 


These practices are not unreasonable. However, it is dangerous to assume that less 
actively traded instruments can always be traded at close to their theoretical prices. When 
financial markets experience a shock of one sort or another, liquidity black holes may 
develop (see Section 24.3). Liquidity then becomes very important to investors, and 
illiquid instruments often sell at a big discount to their theoretical values. 

An example of liquidity risk is provided by Long-Term Capital Management 
(LT'CM), which is the subject of Business Snapshot 22.1. This hedge fund followed a 
strategy known as convergence arbitrage. It attempted to identify two securities (or portfo- 
lios of securities) that should in theory sell for the same price. If the market price of one 
security was less than that of the other, it would buy that security and sell the other. The 
strategy is based on the idea that if two securities have the same theoretical price, their 
market prices should eventually be the same. 

In the summer of 1998, LTCM took a huge loss. This was largely because a default 
by Russia on its debt caused a flight to quality. LTCM tended to be long illiquid instru- 
ments and short the corresponding liquid instruments. (For example, it was long off- 
the-run bonds and short on-the-run bonds.) The spreads between the prices of illiquid 
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instruments and the corresponding liquid instruments widened sharply after the Russian 
default. LTCM was highly leveraged. It experienced huge losses and there were margin 
calls on its positions that it had difficulty meeting. 

The LTCM story reinforces the importance of carrying out scenario analyses and 
stress testing to look at what can happen in extreme scenarios. As discussed in Chapter 22, 
it is important to consider not only immediate losses but also losses created by knock-on 
effects. 


29.3.1 Beware When Everyone Is Following the Same Trading Strategy 


It sometimes happens that many market participants are following essentially the 
same trading strategy. This creates a dangerous environment where there are liable 
to be big market moves, liquidity black holes, and large losses for the market 
participants. 

We gave one example of this in Business Snapshot 24.4, which discussed portfolio 
insurance and the market crash of October 1987. In the months leading up to the crash, 
increasing numbers of portfolio managers were attempting to insure their portfolios by 
creating synthetic put options. This involved buying stocks or stock index futures after 
a rise in the market and selling them after a fall. The result was an unstable market. A 
relatively small decline in stock prices could lead to a wave of selling by portfolio insurers. 
The latter would lead to a further decline in the market, which could give rise to another 
wave of selling, and so on. There is little doubt that, without portfolio insurance, the crash 
of October 1987 would have been much less severe. 

Another example is provided by LTCM in 1998. Its position was made more difficult 
by the fact that many other hedge funds were following similar convergence arbitrage 
strategies. After the Russian default and the flight to quality, LTCM tried to liquidate part 
of its portfolio to meet margin calls. Unfortunately, other hedge funds were facing similar 
problems to LTCM and trying to do similar trades. This exacerbated the situation, causing 
liquidity spreads to be even higher than they would otherwise have been and reinforcing 
the flight to quality. Consider, for example, LTCM’s position in U.S. Treasury bonds. It 
was long the illiquid off-the-run bonds and short the liquid on-the-run bonds. When 
a flight to quality caused spreads between yields on the two types of bonds to widen, 
LTCM had to liquidate its positions by selling oft-the-run bonds and buying on-the-run 
bonds. Other large hedge funds were doing the same. As a result, the price of on-the-run 
bonds rose relative to off-the-run bonds and the spread between the two yields widened 
even more. 

A further example is provided by British insurance companies in the late 1990s. This 
is discussed in Business Snapshot 3.1. All insurance companies decided to hedge their 
exposure to a fall in long-term rates at about the same time. The result was a fall in 
long-term rates! 

The key lesson to be learned from these stories is that it is important to see 
the big picture of what is going on in financial markets and to understand the risks 
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inherent in situations where many market participants are liable to follow the same trad- 
ing strategy. 


29.3.2 Do Not Make Excessive Use of Short-Term Funding for 
Long-Term Needs 


All financial institutions finance long-term needs with short-term sources of funds to 
some extent. But a financial institution that relies too heavily on short-term funds is 
likely to expose itself to unacceptable liquidity risks. 

During the period leading up to the credit crisis of 2007, there was a tendency 
for subprime mortgages and other long-term assets to be financed by commercial paper 
while they were in a portfolio waiting to be packaged into structured products. Conduits 
and special purpose vehicles had an ongoing requirement for this type of financing. 
The commercial paper would typically be rolled over every month. For example, the 
purchasers of commercial paper issued on April 1 would be redeemed with the proceeds 
of a new commercial paper issue on May 1; this new commercial paper issue would in 
turn be redeemed with another new commercial paper issue on June 1;and so on. When 
investors lost confidence in subprime mortgages in August 2007, it became impossible 
to roll over commercial paper. In many instances, banks had provided guarantees and had 
to provide financing. This led to a shortage of liquidity. As a result, the credit crisis was 
more severe than it would have been if longer-term financing had been arranged. 

Many of the failures of financial institutions during the crisis (e.g., Lehman Brothers 
and Northern Rock) were caused by excessive reliance on short-term funding. Once 
the market (rightly or wrongly) becomes concerned about the health of a financial insti- 
tution, it can be impossible to roll over the financial institution’s short-term funding. 
The Basel Committee has recognized the importance of liquidity risks by introducing 
liquidity requirements in Basel III. 


29.3.3 Market Transparency Is Important 


One of the lessons from the credit crisis of 2007 is that market transparency is important. 
During the period leading up to 2007, investors had been trading highly structured 
products without any real knowledge of the underlying assets. All they knew was the 
credit rating of the security being traded. With hindsight, we can say that investors should 
have demanded more information about the underlying assets and should have more 
carefully assessed the risks they were taking. 

The subprime meltdown of August 2007 caused investors to lose confidence in all 
structured products and withdraw from that market. This led to a market breakdown 
where tranches of structured products could only be sold at prices well below their 
theoretical values. There was a flight to quality and credit spreads increased. If there had 
been market transparency, so that investors understood the asset-backed securities they 
were buying, there would still have been subprime losses, but the flight to quality and 
disruptions to the market would have been less pronounced. 


652 RISK MANAGEMENT AND FINANCIAL INSTITUTIONS 


29.4 Lessons for Nonfinancial Corporations 


Here are some lessons applicable primarily to nonfinancial corporations. 


29.4.1 Make Sure You Fully Understand the Trades You Are Doing 


Corporations should never undertake a trade or a trading strategy that they do not fully 
understand. This is a somewhat obvious point, but it is surprising how often a trader 
working for a nonfinancial corporation will, after a big loss, admit to not really under- 
standing what was going on and claim to have been misled by investment bankers. Robert 
Citron, the treasurer of Orange County did this. So did the traders working for Hammer- 
smith and Fulham, who in spite of their huge positions were surprisingly uninformed 
about how the swaps and other interest rate derivatives they traded really worked. 

If a senior manager in a corporation does not understand a trade proposed by a 
subordinate, the trade should not be approved. A simple rule of thumb is that if a trade and 
the rationale for entering into it are so complicated that it cannot be easily understood, it 
is almost certainly inappropriate for the corporation. The trades undertaken by Procter 
& Gamble would have been vetoed under this criterion. 

One way of ensuring that you fully understand a financial instrument is to value it. 
If a corporation does not have the in-house capability to value an instrument, it should 
not trade it. In practice, corporations often rely on their investment bankers for valuation 
advice. This is dangerous, as Procter & Gamble found out. When it wanted to unwind its 
transactions, it found it was facing prices produced by Bankers Trust’s proprietary models, 
which it had no way of checking. 


29.4.2 Make Sure a Hedger Does Not Become a Speculator 


One of the unfortunate facts of life is that hedging is relatively dull, whereas speculation 
is exciting. When a company hires a trader to manage foreign-exchange risk, commodity 
price risk, or interest rate risk, there is a danger that the following happens. At first the 
trader does the job diligently and earns the confidence of top management. He or she 
assesses the company’s exposures and hedges them. As time goes by, the trader becomes 
convinced that he or she can outguess the market. Slowly the trader becomes a speculator. 
At first things go well, but then a loss is made. To recover the loss, the trader doubles down. 
Further losses are made, and so on. The result is a disaster. 

As mentioned earlier, clear limits to the risks that can be taken should be set by senior 
management. Controls should be put in place to ensure that the limits are obeyed. The 
trading strategy for a corporation should start with an analysis of the risks facing the 
corporation in foreign exchange, interest rate, and other markets. A decision should then 
be taken on how the risks are to be reduced to acceptable levels. It is a clear sign that 
something is wrong within a corporation if the trading strategy is not derived in a very 
direct way from the company’s exposures. 
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29.4.3 Be Cautious about Making the Treasury Department a Profit Center 


In the last 30 years, there has been a tendency to make the treasury department within 
a corporation a profit center. This seems to have much to recommend it. The treasurer 
is motivated to reduce financing costs and manage risks as profitably as possible. The 
problem is that the potential for the treasurer to make profits is limited. When raising 
funds and investing surplus cash, the treasurer is facing an efficient market. The treasurer 
can usually improve the bottom line only by taking additional risks. The company’s hedg- 
ing program gives the treasurer some scope for making shrewd decisions that increase 
profits. But it should be remembered that the goal of a hedging program is to reduce risks, 
not to increase expected profits. The decision to hedge will lead to a worse outcome than 
the decision not to hedge roughly 50% of the time. The danger of making the treasury 
department a profit center is that the treasurer is motivated to become a speculator. An 
Orange County or Procter & Gamble type of outcome is then liable to occur. 


29.5 A Final Point 


Most of the risks we have considered in this book are what are termed known risks. 
They are risks such as market risks and credit risks that can be quantified using historical 
data. Two other types of risk are important to financial institutions: unknown risks and 
unknowable risks. 

Unknown risks are risks where the event that could cause a loss is known, but its 
probability of occurrence cannot easily be determined. Operational risks and business 
risks include many different types of unknown risks. What is the probability of a rogue 
trader loss? What is the probability of a loss from a major lawsuit? What is the probability 
that operations in a particular country will be expropriated by the government? These 
probabilities cannot usually be estimated using historical data. As discussed in Chapter 22, 
subjective probabilities are often used. In a widely referenced book, Knight (1921) uses 
the term “risk” to refer to known risks and the term “uncertainty” to refer to unknown 
risks.! 

Unknowable risks are risks where even the event that could cause a loss is not known. 
Unknowable risks are in many ways the most insidious because they come as a complete 
surprise and often lead to dramatic losses. An unknowable risk is sometimes referred to 
as a black swan. (Black swans were not considered possible until they were discovered 
in Australia.) As pointed out by Taleb (2007), once it has occurred, a black swan event 
is often considered to be obvious” Did the producers of multivolume encyclopedias in 
1970 consider the possibility that technological developments would render their product 
worthless? Probably not, but ex post it seems a fairly obvious risk. 


| See F H. Knight, Risk, Uncertainty and Profit (Boston: Houghton Mifflin, 1921). 
?See N. N. Taleb, The Black Swan: The Impact of the Highly Improbable (New York: Random House, 
2007). 
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How can companies manage unknown and unknowable risks? A key tool is 
flexibility. Companies should avoid excessive leverage and try to ensure that their costs 
are variable rather than fixed as far as possible. Diversification across products and 
markets also increases flexibility. In the future, insurance companies may well develop 
more products to handle unknown and unknowable risks. As discussed in Chapter 23, 
products have already been developed to provide protection against some operational 
(unknown) risks. Handling unknowable risk is a challenging (but not totally impossible) 
contract design problem. 
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Part Six 


APPENDICES 


Appendix A 


Compounding Frequencies for 
Interest Rates 


annum sounds straightforward and unambiguous. In fact, its precise meaning 


N statement by a bank that the interest rate on one-year deposits is 10% per 


depends on the way the interest rate is measured. 
If the interest rate is measured with annual compounding, the bank’s statement that 
the interest rate is 10% means that $100 grows to 


$100 x 1.1 = $110 


at the end of one year. When the interest rate is measured with semiannual compounding, 
it means that we earn 5% every six months, with the interest being reinvested. In this 
case, $100 grows to 


$100 x 1.05 x 1.05 = $110.25 


at the end of one year. When the interest rate is measured with quarterly compounding, 
the bank’s statement means that we earn 2.5% every three months, with the interest being 
reinvested. The $100 then grows to 


$100 x 1.0254 = $110.38 


at the end of one year. Table A.1 shows the effect of increasing the compounding fre- 
quency further. 
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Table A.1 Effect of the Compounding Frequency on the 
Value of $100 at the End of One Year When the Interest 
Rate is 10% per Annum 


Compounding Value of $100 at 
Frequency End of Year ($) 
Annually (m = 1) 110.00 
Semiannually (m = 2) 110.25 
Quarterly (m = 4) 110.38 
Monthly (m = 12) 110.47 
Weekly (m = 52) 110.51 
Daily (m = 365) 110.52 


The compounding frequency defines the units in which an interest rate is measured. 
A rate expressed with one compounding frequency can be converted into an equivalent 
rate with a different compounding frequency. For example, from Table A.1 we see that 
10.25% with annual compounding is equivalent to 10% with semiannual compounding. 
We can think of the difference between one compounding frequency and another to be 
analogous to the difference between kilometers and miles. They are two different units 
of measurement. 

To generalize our results, suppose that an amount A is invested for n years at an 
interest rate of R per annum. If the rate is compounded once per annum, the terminal 
value of the investment is 


A(1 + R)" 


If the rate is compounded m times per annum, the terminal value of the investment is 


A (1 + Ry" (A.1) 


m 


When m = 1 the rate is sometimes referred to as the equivalent annual interest rate. 


A.1 Continuous Compounding 


The limit as the compounding frequency, m, tends to infinity is known as continuous 
compounding! With continuous compounding, it can be shown that an amount A invested 
for n years at rate R grows to 


Aec®" (A.2) 


where e = 2.71828. The function e*, which is also written exp(x), is built into most 
calculators, so the computation of the expression in equation (A.2) presents no problems. 


' Actuaries sometimes refer to a continuously compounded rate as the force of interest. 
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In the example in Table A.1, A = 100, n = 1, and R = 0.1, so that the value to which A 
grows with continuous compounding is 


100e°! = $110.52 


This is (to two decimal places) the same as the value with daily compounding. For most 
practical purposes, continuous compounding can be thought of as being equivalent to 
daily compounding. Compounding a sum of money at a continuously compounded rate 


Rn 


R for n years involves multiplying it by e*”. Discounting it at a continuously compounded 


rate R for n years involves multiplying by e~*". 


Suppose that R, is a rate of interest with continuous compounding and R,, is the 
equivalent rate with compounding m times per annum. From the results in equations 


(A.1) and (A.2), we have 
R, mn 
Ae®" = A (: + 2a) 
m 


or 


This means that 


Rm 
R.=mln (1) (A.3) 


m 


and 


R,/n 
R,, = me" — 1) (A.4) 
These equations can be used to convert a rate with a compounding frequency of m 
times per annum to a continuously compounded rate and vice versa. The function In is 
the natural logarithm function and is built into most calculators. It is defined so that if 
y = lnx, then x = æ. 


Example A.1 


Consider an interest rate that is quoted as 10% per annum with semiannual compounding. 
From equation (A.3), with m = 2 and R,, = 0.1, the equivalent rate with continuous 
compounding is 


21In (1 + =) = 0.09758 


or 9.758% per annum. 
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Example A.2 


Suppose that a lender quotes the interest rate on loans as 8% per annum with continuous 
compounding, and that interest is actually paid quarterly. From equation (A.4), with 
m = 4 and R, = 0.08, the equivalent rate with quarterly compounding is 


4(¢9-°8/4 _ 1) = 0.0808 


or 8.08% per annum. This means that on a $1,000 loan, interest payments of $20.20 
would be required each quarter. 


Appendix B 


Zero Rates, Forward Rates, 


and Zero-Coupon 
Yield Curves 


T he n-year zero-coupon interest rate is the rate of interest earned on an invest- 
ment that starts today and lasts for n years. All the interest and principal is real- 
ized at the end of n years. There are no intermediate payments. The n-year 
zero-coupon interest rate is sometimes also referred to as the n-year spot interest rate, the 
n-year zero rate, or just the n-year zero. The zero rate as a function of maturity is referred 
to as the zero curve. Suppose a five-year zero rate with continuous compounding is quoted 
as 5% per annum. (See Appendix A for a discussion of compounding frequencies.) This 


means that $100, if invested for five years, grows to 
100 x e? = 128.40 


A forward rate is the future zero rate implied by today’s zero rates. Consider the zero 
rates shown in Table B.1. The forward rate for the period between six months and one 
year is 6.6%. This is because 5% for the first six months combined with 6.6% for the next 
six months gives an average of 5.8% for the first year. Similarly, the forward rate for the 
period between 12 months and 18 months is 7.6%, because this rate, when combined 
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Table B.1 Zero Rates 


Maturity Zero Rate (%) 

(years) (cont. comp.) 
0.5 5.0 
1.0 5.8 
1.5 6.4 
2.0 6.8 


with 5.8% for the first 12 months, gives an average of 6.4% for the 18 months. In general, 
the forward rate F for the period between times T} and T, is 


pee eee (B.1) 
T,-T; 

where R; is the zero rate for maturity T} and R, is the zero rate for maturity T. This 
formula is exactly true when rates are measured with continuous compounding and 
approximately true for other compounding frequencies. The results from using this for- 
mula on the rates in Table B.1 are in Table B.2. For example, substituting T} = 1.5, 
T, = 2.0, R, = 0.064, and R, = 0.068 gives F = 0.08 showing that the forward rate for 
the period between 18 months and 24 months is 8%. 

Investors who think that future interest rates will be markedly different from forward 
rates have no difficulty in finding trades that reflect their beliefs. Consider an investor who 
can borrow or lend at the rates in Table B.1. Suppose the investor thinks that the six- 
month interest rates will not change much over the next two years. The investor can 
borrow six-month funds and invest for two years. The six-month borrowings can be 
rolled over at the end of 6, 12, and 18 months. If interest rates do stay about the same, 
this strategy will yield a profit of about 1.8% per year because interest will be received 
at 6.8% and paid at 5%. This type of trading strategy is known as a yield curve play. The 
investor is speculating that rates in the future will be quite different from the forward 
rates shown in Table B.2. 

Robert Citron, the treasurer at Orange County, used yield curve plays similar to 
the one we have just described very successfully in 1992 and 1993. The profit from 
Mr. Citron’s trades became an important contributor to Orange County’s budget and 
he was re-elected. In 1994, he used the same strategy more aggressively. If short-term 
interest rates had remained the same or declined, he would have done very well. As it 


Table B.2 Forward Rates for Zero Rates in Table B.1 


Period Forward Rate (%) 
(years) (cont. comp.) 
0.5 to 1.0 6.6 
1.0 to 1.5 7.6 


1.5 to 2.0 8.0 
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happened, interest rates rose sharply during 1994. On December 1, 1994, Orange County 
announced that its investment portfolio had lost $1.5 billion and several days later it filed 
for bankruptcy protection. 


B.1 Bond Pricing 


Most bonds provide coupons periodically. The bond’s principal (which is also known as 
its par value or face value) is received at the end of its life. The theoretical price of a bond 
can be calculated as the present value of all the cash flows that will be received by the 
owner of the bond. The most accurate approach is to use a different zero rate for each 
cash flow. To illustrate this, consider the situation where zero rates are as in Table B.1. 
Suppose that a two-year bond with a principal of $100 provides coupons at the rate of 
6% per annum semiannually. To calculate the present value of the first coupon of $3, we 
discount it at 5% for six months; to calculate the present value of the second coupon of 
$3, we discount it at 5.8% for one year; and so on. The theoretical price of the bond is 
therefore: 


3,e—0-05x0.5 + 3e70058X1.0 + 3 e7 0.064X1.5 4 103 @70:068x2.0 = 98.39 


or $98.39. 


B.2 Bond Yields 


A bond’ yield is the discount rate that, when applied to all cash flows, gives a bond price 
equal to its market price. Suppose that the theoretical price of the bond we have been 
considering, $98.39, is also its market value (i.e., the market’s price of the bond is in 
exact agreement with the data in Table B.1). If y is the yield on the bond, expressed with 
continuous compounding, we must have: 


Je POS 4 Ze VX10 4 3X13 + 103e? = 98.39 


This equation can be solved using Excel’s Solver or in some other way to give y = 6.76%. 


B.3 Treasury Rates 


Treasury rates are the rates an investor earns on Treasury bills and Treasury bonds. These 
are the instruments used by a government to borrow in its own currency. Japanese Trea- 
sury rates are the rates at which the Japanese government borrows in yen; U.S. Treasury 
rates are the rates at which the U.S. government borrows in U.S. dollars; and so on. 


B.4 Determining Treasury Zero Rates 


One way of determining Treasury zero rates such as those in Table B.1 is to observe the 
yields on “strips.” These are zero-coupon bonds that are artificially created by traders 
when they sell the coupons on a Treasury bond separately from the principal. 
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Table B.3 Rates After Two-Year Rate Has Been 
Determined Using the Bootstrap Method 


Maturity Zero Rate (%) 
(years) (cont. comp.) 
0.5 5.00 
1.0 5.80 
1.5 6.40 
2.0 6.80 
2.5 2? 


Another way of determining Treasury zero rates is from regular Treasury bills and 
bonds. The most popular approach is known as the bootstrap method. This involves working 
from short maturities to successively longer maturities matching prices. Suppose that 
Table B.3 gives the Treasury rates determined so far and that a 2.5-year bond providing 
a coupon of 8% sells for $102 per $100 of principal. We would determine the 2.5-year 
zero-coupon interest rate as the rate which, when used in conjunction with the rates in 
Table B.3, gives the correct price for this bond. This involves solving 


= 5 — 5 — 5 — — 2 
4e 0.05x0.5 + 4e 0.058x1.0 + 4e 0.064X1.5 + 4e 0.068X2.0 + 104e RXx2.5 — 102 


which gives R = 7.05%. The zero curve is usually assumed to be linear between the 
points that are determined by the bootstrap method. (In our example, the 2.25-year zero 
rate would be 6.9025%.) It is also assumed to be constant prior to the first point and 
beyond the last point. The zero curve for our example is shown in Figure B.1. 


| Zero rate (%) 


Maturity (years) 


04 T T T T T 1 
0 0.5 1 15 2 2.5 3 


Figure B.1 Zero Curve for Data in Table B.3 
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B.5 OIS Zero Rates 


The OIS zero curve is determined using a similar bootstrap method to that used for 
determining the Treasury zero curve. The OIS rates for maturities out to one year involve 
a single exchange at maturity. They therefore give immediate information about the zero 
rates for those maturities. OIS rates for maturities longer than one year are typically for 
contracts where there are exchanges every three months. They are the yields on bonds 
that sell for par and provide quarterly payments. 

Suppose that the 3-month, 6-month, 9-month, and 12-month OIS zero rates have 
been calculated as 2%, 2.3%, 2.5%, and 2.7%, respectively (with continuous compound- 
ing). Suppose further that the 1.25-year OIS rate is 2.8%. A 1.25-year bond with a 
principal of 100 providing a coupon of 0.7 every three months is worth par (i.e., 100). 
If R is the 1.25-year OIS zero rate, we must have 


0).7e70:02%0.25 + 0.7 e70-023x0.5 + 0.7e70-025x0.75 + 0.7¢7 09271 + 100.7e7 R125 = 100 


This gives R = 2.794%. 


Appendix C 


Valuing Forward and 
Futures Contracts 


he forward or futures price of an investment asset that provides no income is 
given by 
Fo = Sie 
where Sọ is the spot price of the asset today, T is the time to maturity of the forward 
or futures contract, and r is the continuously compounded risk-free rate for maturity T. 
When the asset provides income during the life of the contract that has a present value 
I, this becomes 
Fo = (So — De" 
When it provides a yield at rate q, it becomes 


e 


A foreign currency can be regarded as an investment asset that provides a yield equal to 
the foreign risk-free rate, so that the forward or futures price for a foreign currency is 


Fo = Soet TT 


667 


668 RISK MANAGEMENT AND FINANCIAL INSTITUTIONS 


where r; is the foreign risk-free rate (continuously compounded) and Sọ is the spot 
exchange rate. The value of a forward contract where the holder has the right to buy the 
asset for a price of K is, in all cases, 


(Fy — Kye" 


where F, is the forward price, given by one of the formulas above. The value of a forward 
contract where the holder has the right to sell the asset for a price of K is similarly 


(K = Foje” 


Example C.1 


Consider a six-month futures contract on the S&P 500. The current value of the index 
is 1,200, the six-month risk-free rate is 5% per annum, and the average dividend yield 
on the S&P 500 over the next six months is expected to be 2% per annum (both rates 
continuously compounded). The futures price is 1,200e0-05-0.02)x0.5 or 1,218.14. 


Example C.2 


The current forward price of a commodity for a contract maturing in nine months is 
$550 per ounce. A company has a forward contract to buy 1,000 ounces of the com- 
modity for a delivery price of $530 in nine months. The nine-month risk-free rate 
is 4% per annum continuously compounded. The value of the forward contract is 
1,000 x (550 — 530)e7®04*9/12_ or $19,409. 


In most situations forward and futures prices can be assumed to be the same. An excep- 
tion is provided by contracts on interest rates. The quote for a Eurodollar futures contract 
should not be assumed to be the same as the corresponding forward interest rate, partic- 
ularly when the contract has a long time to maturity. 


Appendix D 


Valuing Swaps 


plain vanilla interest rate swap can be valued by assuming that the interest 
Ak that are realized in the future equal today’s forward interest rates. As an 

example, consider an interest rate swap that has 14 months remaining and a 
notional principal of $100 million. A fixed rate of 5% per annum is received and LIBOR 
is paid, with exchanges taking place every six months. Assume that (a) four months 
ago, the six-month LIBOR rate was 4%, (b) the forward LIBOR interest rate for a 
six-month period starting in two months is 4.6%, and (c) the forward LIBOR for a six- 
month period starting in eight months is 5.2%. All rates are expressed with semiannual 
compounding. Assuming that forward rates are realized, the cash flows on the swap are 
as shown in Table D.1. (For example, in eight months the fixed-rate cash flow received 
is 0.5 X 0.05 x 100, or $2.5 million; the floating-rate cash flow paid is 0.5 X 0.046 x 100, 
or $2.3 million.) The value of the swap is the present value of the net cash flows in the 
final column.' The discount rates used to discount the cash flows are usually OIS zero 
rates. (See Appendix B for their calculation.) 


D.1 LIBOR Forward Rates 


LIBOR forward rates can be calculated using a bootstrap method. Suppose that the 6- 
month, 12-month, 18-month, and 24~month OIS zero rates are 3.8%, 4.3%, 4.6%, and 
4.75% (with continuous compounding). Suppose that in a two-year swap where cash 


1 Note that the precise timing of cash flows depends on day count conventions and holiday calendars. 
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Table D.1 Valuing an Interest Rate Swap by Assuming Forward Rates Are Realized 


Fixed Cash Floating Cash Net Cash 
Time Flow ($ million) Flow ($ million) Flow ($ million) 
2 months 2.5 —2.0 0.5 
8 months 2.5 —2.3 0.2 
14 months 2.5 —2.6 —0.1 


flows are exchanged every six months the swap rate is 5%. Suppose further that the 0.5- 
year LIBOR rate is 4%, the forward rate for the period between 6 months and 12 months 
is 5%, and the forward rate for the period between 12 months and 18 months is 5.5% 
(all semiannually compounded). The value of a two-year swap where LIBOR is received 
and 5% is paid is zero. The value of the first exchange is 


0.5 X (0.04 — 0.05) x 100 x e7®-038x0.5 = —0.4906 
The value of the second exchange is 
0.5 x (0.05 — 0.05) x 100 x e749! = 0 
The value of the third exchange is 
0.5 X (0.055 — 0.05) x 100 x e™®048x1.5 = 0,2333 


The total value of the first three payments is —0.4906 + 0 + 0.2333 = —0.2573 Suppose 
that the (assumed unknown) forward rate for the final payment is F. For the swap to be 
worth zero, we must have 


0.5 x (F — 0.05) x 100 x e~ 478%? = 0.2573 


This gives F = 0.05565 or 5.565%. 


D.2 Currency Swaps 


A currency swap can be valued by assuming that exchange rates in the future equal today’s 
forward exchange rates. As an example, consider a currency swap in which 4% will be 
received in GBP and 6% will be paid in USD once a year. The principals in the two 
currencies are 10 million USD and 5 million GBP. The swap will last for another three 
years. The swap cash flows are shown in the second and third columns of Table D.2. The 
forward exchange rates are (we assume) those shown in the fourth column. These are 
used to convert the GBP cash flows to USD. The final column shows the net cash flows. 
The value of the swap is the present value of these cash flows. 
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Table D.2 Valuing a Currency Swap by Assuming Forward Exchange Rates Are Realized (all cash 
flows in millions) 


USD Cash GBP Cash Forward USD Value of Net Cash 
Time Flow Flow Exchange Rate GBP Cash Flow Flow in USD 
1 —0.6 0.2 1.8000 0.360 —0.240 
2 —0.6 0.2 1.8400 0.368 —0.232 
3 —0.6 0.2 1.8800 0.376 —0.224 
3 —10.0 5.0 1.8800 9.400 —0.600 


An alternative approach (which gives the same valuation) is to regard the swap as a 
long position in a GBP bond and a short position in a USD bond. Each bond can be 
valued in its own currency in the usual way and the current exchange rate can be used 
to convert the value of the GBP bond from GBP to USD. 


Appendix E 


Valuing European Options 


on an investment asset that provides no income are 


T he Black—Scholes—Merton formulas for valuing European call and put options 


c = S,N(d,) — Ke N(d,) 
and 
p = Ke" N(—dy) — SyN(-d,) 


where 
_ In(So/K) + (+. 67/2)T 


oVT 
In (Sy/K) + (r — 67/2)T 

_ In(So/K) + ( /2) =4 eT 

svT 
The function N(x) is the cumulative probability distribution function for a standardized 
normal distribution (see tables at the end of the book or Excel’s NORMSDIST function). 
The variables c and p are the European call and European put price, Sọ is today’s asset 
price, K is the strike price, r is the continuously compounded risk-free rate, 6 is the stock 
price volatility, and T is the time to maturity of the option. 


di 


d, 
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Table E.1 Greek Letters for Options on an Asset That Provides a Yield at Rate q 


Greek Letter Call Option Put Option 
Delta eI N(d,) e-1 (N(d,) — 1] 
N’ (d, )e aT N'(d,)e-47 
Gamma ee oN WAZ 
So yT So VT 
-So N’ (d, J)o" /(2 VT) —SyN'(d,)oe17 / (2 VT) 
Theta (per yr) +qSyN(d, Je" ~qSyN(-d,)et 
—rKe™'T N(d) +rKe—" N(—d>) 
So VT NI! (dye So VT N' (d eT 
Vega (per %) — SEE Po 
100 100 
KTT Nd KTe N(-d 
Rho (per %) oe (6) E Ce) 
100 100 


When the underlying asset provides a cash income, the present value of the income 
during the life of the option should be subtracted from Sọ. When the underlying asset 
provides a yield at rate q, the formulas become: 


c = Soe?’ N(d,) — Ke" N(d,) 
and 
p = Ke N(=dy) — Set" N(-d,) 
where 


_ In(So/K) + 4 - q+.0°/2)T 


d, = 
oT 


esse ce pe 
p= P/K +e- 7T a oyT 


oVT 


Options on a foreign currency can be valued by setting q equal to the foreign risk-free 
rate. Options on a futures or forward price can be valued using these formulas by setting 
Sp equal to the current forward or futures price, r = q, and © equal to the volatility of 
the forward or futures price. 

Table E.1 gives formulas for the Greek letters for European options on an asset that 
provides income at rate q. N’(x) is the standard normal density function: 
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The implied volatility is the volatility, 6, which when substituted into a Black-Scholes- 
Merton pricing formula gives the market price. (See Section 10.2.) When Greek letters 
are calculated, © is set equal to the implied volatility. 


Example E.1 


Consider a six-month European call option on a stock index. The current value of the 
index is 1,200, the strike price is 1,250, the risk-free rate is 5%, the dividend yield on the 
index is 2%, and the index volatility is 20%. In this case, Sy = 1,200, K = 1,250, r = 0.05, 
q = 0.02,6 = 0.2,and T = 0.5. The value of the option is 53.44, the delta of the option 
is 0.45, the gamma is 0.0023, the theta is —0.22, the vega is 3.33, and rho is 2.44. Note 
that the formula in Table E.1 gives theta per year. The theta quoted here is per calendar 
day. 


The calculations in this appendix can be done with the RMFI software, available on 
the author’s website, by selecting Option Type: Black-Scholes European. Option valua- 
tion is described more fully in Hull (2018)! 


! See J.C. Hull, Options, Futures, and Other Derivatives, 10th ed. (Upper Saddle River, NJ: Pearson, 2018). 


Appendix F 


Valuing American Options 


T o value American-style options, we divide the life of the option into n time 
steps of length At. Suppose that the asset price at the beginning of a step is S. 
At the end of the time step it moves up to Su with probability p and down to 
Sd with probability 1 — p. For an investment asset that provides no income the values of 


u, d, and p are given by 


u= ovat gui gat" 
u u—d 
with 
a= d^ 


where r is the risk-free rate and o is the volatility. 

Figure E1 shows the tree constructed for valuing a five-month American put option 
on a non-dividend-paying stock where the initial stock price is 50, the strike price is 50, 
the risk-free rate is 10%, and the volatility is 40%. In this case, there are five steps so that 
At = 0.08333, u = 1.1224, d = 0.8909, a = 1.0084, and p = 0.5073. The upper number 
at each node is the stock price and the lower number is the value of the option. 

At the final nodes of the tree, the option price is its intrinsic value. For example, 
at node G, the option price is 50 — 35.36 = 14.64. At earlier nodes, we first calculate a 
value assuming that the option is held for a further time period of length At and then 
check to see whether early exercise is optimal. Consider first node E. If the option is 
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At each node: 

Upper value = Underlying asset price 
Lower value = Option price 

Shading indicates where option is exercised. 


Strike price = 50 
Discount factor per step = 0.9917 

Time step, dt = 0.0833 years, 30.42 days 
Growth factor per step, a = 1.0084 
Probability of up move, p = 0.5073 
Up-step size, u = 1.1224 
Down-step size, d = 0.8909 


F 


Node time: 
0.0000 0.0833 0.1667 0.2500 0.3333 0.4167 


Figure F1 Binomial Tree for American Put on Non-Dividend-Paying Stock 


held for a further time period, it will be worth 0.00 if there is an up move (probability: 
p) and 5.45 if there is a down move (probability: 1 — p). The expected value in time At is 
therefore 0.5073 X 0 + 0.4927 x 5.45 or 2.686, and the 2.66 value at node E is calculated 
by discounting this at the risk-free rate of 10% for one month. The option should not be 
exercised at node E as the payoff from early exercise would be zero. Consider next node 
A.A calculation similar to that just given shows that, assuming it is held for a further time 
period, the option’s value at node A is 9.90. If exercised, its value is 50 — 39.69 = 10.31. 
In this case, it should be exercised and the value of being at node A is 10.31. 

Continuing to work back from the end of the tree to the beginning, the value of the 
option at the initial node D is found to be 4.49. As the number of steps on the tree is 
increased, the accuracy of the option price increases. With 30, 50, and 100 time steps, we 
get values for the option of 4.263, 4.272, and 4.278, respectively. 

To calculate delta, we consider the two nodes at time Aft. In our example, as we move 
from the lower node to the upper node, the option price changes from 6.96 to 2.16 and 
the stock price changes from 44.55 to 56.12. The estimate of delta is the change in the 
option price divided by the change in the stock price: 


ie ae 
56.12 — 44.55 
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To calculate gamma we consider the three nodes at time 2At. The delta calculated from 
the upper two nodes (C and F) is —0.241. This can be regarded as the delta for a stock 
price of (62.99 + 50)/2 = 56.49. The delta calculated from the lower two nodes (B and 
C) is —0.639. This can be regarded as the delta for a stock price of (50 + 39.69) /2 = 
44.84. The estimate of gamma is the change in delta divided by the change in the stock 
price: 


—0.241 — (—0.639) 
Gamma = ————————_ = 0.034 
56.49 — 44.84 


We estimate theta from nodes D and C as 


3.77 — 4.49 


Theta = —————— 
2 X 0.08333 


or —4.30 per year. This is —0.0118 per calendar day. Vega is estimated by increasing the 
volatility, constructing a new tree, and observing the effect of the increased volatility on 
the option price. Rho is calculated similarly. 

When the asset underlying the option provides a yield at rate q, the procedure is 


exactly the same except that a = e-94" instead of e! 


in the equation for p. (When the 
asset is a foreign currency q is set equal to the foreign risk-free rate.) When the option 
is on the forward or futures price, a is set equal to one and the tree shows the forward 
or futures price at each node. The calculations we have described can be done using the 
RMFI software, available on the author’s website, by selecting Option Type: Binomial 
American. Binomial trees and other numerical procedures are described more fully in 


Hull (2018).! 


"See J.C. Hull, Options, Futures, and Other Derivatives, 10th ed. (Upper Saddle River, NJ: Pearson, 2018). 


Appendix G 


Taylor Series Expansions 


corresponding small change Az in z. A first approximation to the relationship 


between Az and Ax is 


C onsider a function z = F(x). When a small change Ax is made to x, there is a 


Az = Ciy (G.1) 
dx 


This relationship is exact if z is a linear function of x and approximate in other situations. 


A more accurate approximation is 
— (Ax)? (G.2) 
x 


This relationship is exact if z is a quadratic function of x and approximate in other situa- 
tions. By adding more terms in the series, we can increase accuracy. The full expansion is 


where ! denotes the factorial function: 2! =2x*1=2;3!=3x2x1=6;4!=4x3x 
2X1 = 24; and so on. 
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Example G.1 


Consider the function z = yx so that 


dz_ 1 dz 1 Pz 3 


dx 2x!/2 a 427 dx? 8x3/2 


Suppose that x = 2 and Ax = 0.1 so that Az = 2.1 — y2 = 0.034924. When x = 2, 


2 3 
ile = 0.35355 az = —0.08839 E = 0.06629 
dx x2 dx? 


The first order approximation to Az, given by equation (G.1), is 


Az = 0.35355 x 0.1 = 0.035355 


The second order approximation, given by equation (G.2), is 


Az = 0.35355 x 0.1 + 5 x (—0.08839) x 0.17 = 0.034913 


The third order approximation is 
Az = 0.35355 X 0.1 + > x (—0.08839) x 0.17 + L x 0.06629 x 0.1° = 0.034924 


It can be seen that the series expansion quickly converges to the correct answer of 


0.034924. 


G.1 Functions of Two Variables 


Consider next a function of two variables, z = F(x, y). Suppose that Ax and Ay are small 
changes in x and y, respectively, and that Az is the corresponding small change in z. In 
this case, the first order approximation is 


Az = SAx+ Ay (G.3) 
x y 


The second order approximation is 


Oz Oz 10°z 1 
Az = Ax + —Ay + =—>(Ax) + = (Ay)” 
a Ox Oy 2 Ox? a) 2 r. y+ 


Taylor Series Expansions 
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Example G.2 
Consider the function z = ,/xy so that 


Oz yl? Oz xt/2 
Ox 2x1/2 oy 2y!/2 


Oz y” Oz xl? Oz 1 
Ox? 4x3/2 oy? 49/2 ðxðy — 4(xy)!/2 


Suppose that x = 2, y = 1, Ax = 0.1, and Ay = 0.1 so that 


Az = Ņy2.1 X 1.1 — y2 x1 = 0.10565 


When x = 2 and y= 1 


cs = 0.35355 2 = 0.70711 
Ox oy 
2 2 2 
ve = —0.08839 a = —0.35355 = 0.17678 
a Oy? Oxdy 


The first order approximation to Az, given by equation (G.3), is 
Az = 0.35355 x 0.1 + 0.70711 x 0.1 = 0.10607 
The second order approximation, given by equation (G.4), is 
Az = 0.35355 X 0.1 + 0.70711 x 0.1 
+ L x (—0.08839) x (0.1)? + L x (—0.35355) x (0.1)? 
+ 0.17678 x 0.1 X 0.1 = 0.10562 


The series solution is converging to the correct answer of 0.10565. 


G.2 General Result 


For a function z of n variables, x4, X2, ... , X„, the Taylor series expansion of Az is 
m; =0%0 m,=0%0 1 g" 
A. m 
Az= vee A as Ah 
m basi m !9 mı PA 0. mM, 1 n 
m,=0 m,,=0 1° n` Xi Xn 


where m = m; + +++ m, and the term where all m; are zero is zero. 


Appendix H 


Eigenvectors and Eigenvalues 


onsider an n X n matrix A and suppose that x is an n X 1 vector. Consider the 
equation 


Ax = Ax (H.1) 
The equation can be written 
(A-ADx =0 


where I is the n X n identity matrix (which is the n X n matrix with diagonal elements 
equal to 1 and all other elements equal to zero). Clearly, x = 0 is a solution to equation 
(H.1). Under what circumstances are there other solutions? A theorem in linear algebra 
tells us that there are other solutions when the determinant of A — XI is zero. The values 
of A that lead to solutions of equation (H.1) are therefore the values of À that we get 
when we solve the equation that sets the determinant of A — AI equal to zero. This 
equation is an nth order polynomial in À. In general, it has n solutions. The solutions are 
the eigenvalues of the matrix, A. The vector x that solves equation (H.1) for a particular 
eigenvalue is an eigenvector. In general, there are n eigenvectors, one corresponding to 
each eigenvalue. 
As a simple example, suppose that 
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In this case 


The determinant of this matrix is 
(1—A)(4—A) —(-1) K2=A7 — 5046 


The solutions to this equation are A = 3 and A = 2. These are the two eigenvalues of the 
matrix. 
To determine the eigenvectors corresponding to A = 3, we solve equation (H.1): 


1 -1 
€ eai 


Setting 


the equation to be solved becomes 
(2 Eek) 

The simultaneous equations corresponding to this are 

xX — X = 3x, 
and 

2x, + 4x = 3x, 

Both these equations are equivalent to 

X, + 2x, = 0 


It follows that any pair of numbers x, and x, solve the equation when A = 3 providing 
x, = —2x,. By convention, the values of x, and x, that are reported are those for which 
the length of the vector x is 1. This means that x$ + x2 = 1. In this case, the solution where 
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x has a length of 1 is x, = 0.2 = 0.447 and x, = —2 y 0.2 = —0.894. (An alternative 
is x, = —0.447 and x, = 0.894.) The solution 


z= 0.447 
~ \-0.894 
is the eigenvector corresponding to the first eigenvalue (A = 3). 
A similar calculation shows that equation (H.1) is satisfied for A = 2 when x, + x, = 
0. The solution where x has a length of 1 is x, = 0.5 = 0.707 and x, = -V0.5 = 


—0.707. (An alternative is x, = —0.707 and x, = 0.707.) The eigenvector corresponding 
to the second eigenvalue (A = 2) is therefore 


n= 0.707 
~ \=0.707 
For matrices of larger size,a numerical procedure must be used to determine eigenvectors 
and eigenvalues. One such numerical procedure is provided by Press et al. (2007).! 


Applications of eigenvalues and eigenvectors are in Appendices I and J. Software for 
calculating eigenvectors and eigenvalues is on the author’s website. 


| See W. H. Press, S. A. Teukolsky, W. T. Vetterling, and B. P. Flannery, Numerical Recipes: The Art of 
Scientific Computing, 3rd ed. (Cambridge, UK: Cambridge University Press, 2007). 


Appendix I 


Principal Components Analysis 


principal components analysis is concerned with understanding the structure of 
A data on n correlated variables. The aim of the analysis is to replace the n variables 

by a smaller number of uncorrelated variables. In the example in Section 9.7, 
there are eight variables. These are the daily changes in the 1-year, 2-year, 3-year, 4-year, 
5-year, 7-year, 10-year, and 30-year swap rates. 

The first step in the analysis is to calculate a covariance matrix from the data. As 
explained in Section 11.3, this is an n X n matrix where the (i,j) entry is the covariance 
between variable i and variable j. The entries on the diagonal (where i = /) are variances. 

The next step is to calculate the eigenvalues and eigenvectors for this matrix (see 
Appendix H). The eigenvectors are chosen to have length 1. (As explained in Appendix 
H, this means that the sum of the squares of their elements is 1.) The eigenvector cor- 
responding to the highest eigenvalue is the first principal component; the eigenvector 
corresponding to the second highest eigenvalue is the second principal component; and 
so on. The principal components for the example in Section 9.7 are shown in Table 9.6. 

The eigenvalue for the ith principal component, as a percentage of the sum of all 
the eigenvalues, is the percentage of the overall variance explained by the ith principal 
component. The square root of the ith eigenvalue is the standard deviation of the ith 
factor score (see Table 9.7). 

Software for carrying out a principal components analysis is on the author’s website. 
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Appendix J 


Manipulation of Credit 
Transition Matrices 


uppose that A is an nX n matrix of credit rating changes in one year. This is a 

matrix such as the one shown in Table 21.1. Assuming that rating changes in 

successive time periods are independent, the matrix of credit rating changes in m 
years is A”. If m is an integer, this can be readily calculated using the normal rules for 
matrix multiplication. 

Consider next the problem of calculating the transition matrix for 1/m years where m 
is an integer. (For example, when we are interested in one-month changes, m = 12.) This 
is a more complicated problem because we need to calculate the mth root of a matrix. We 
first calculate eigenvectors x4, X3, ..., X, and the corresponding eigenvalues A4, 2, ..., À 


n 


of the matrix A. These are explained in Appendix H. They have the property that 
Ax; = VX; (J.1) 


Define X as an n X n matrix whose ith column is x; and A as an n X n diagonal matrix 
(i.e., a matrix that has zero values everywhere except on the diagonal) where the ith 
diagonal element is A;. From equation (J.1), we have 


AX =XA 
so that 


A= XAX™! 
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Define A* as a diagonal matrix where the ith diagonal element is aif "Then 


SO I XT!) ... CX Sax 
= XAX!=A 


showing that the mth root of A, and therefore the transition matrix for time 1/m years, 
is XA*X7!, 

Some authors, such as Jarrow, Lando, and Turnbull (1997), prefer to handle this prob- 
lem in terms of what is termed a generator matrix! This is a matrix T such that the tran- 
sition matrix for a short period of time, Af, is I +TAt and the transition matrix for a 
longer period of time, t, is 


where I is the identity matrix (with ones on the diagonal and zeros elsewhere). 
Software for manipulating credit transition matrices is on the author’s website. 


"See R. A. Jarrow, D. Lando, and S. M. Turnbull, “A Markov Model for the Term Structure of Credit 
Spreads,” Review of Financial Studies 10 (1997): 481-523. 


Appendix K 


Valuation of Credit 
Default Swaps 


( ) redit default swaps (CDSs) are described in Chapter 19. They can be valued 
using (risk-neutral) default probability estimates. 

Suppose that the hazard rate of a reference entity is 2% per annum for five 
years. Table K.1 shows survival probabilities and unconditional default probabilities (that 
is, default probabilities as seen at time zero) for each of the five years. From equation 
(19.2), the probability of survival to time t is e~°-°*'. The probability of default during a 
year is the probability of survival to the beginning of the year minus the probability of 
survival to the end of the year. For example, the probability of survival to time 2 years 
is e7002 = 0,9608 and the probability of survival to time 3 years is e~?-°?*? = 0.9418. 
The probability of default during the third year is 0.9608 — 0.9418 = 0.0190. 

We will assume that defaults always happen halfway through a year and that payments 
on a five-year credit default swap are made once a year, at the end of each year. We also 
assume that the risk-free interest rate is 5% per annum with continuous compounding 
and the recovery rate is 40%. There are three parts to the calculation. These are shown 
in Tables K.2, K.3, and K.4. 

Table K.2 shows the calculation of the present value of the expected payments 
made on the CDS assuming that payments are made at the rate of s per year and the 
notional principal is $1. For example, there is a 0.9418 probability that the third pay- 
ment of s is made. The expected payment is therefore 0.9418s and its present value is 
0.941 8se~0-*3 = 0.8106s. The total present value of the expected payments in Table K.2 
is 4.0728s. 
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Table K.1 Unconditional Default Probabilities and Survival Probabilities 


Time (years) Probability of Surviving to Year End Probability of Default During Year 
1 0.9802 0.0198 
2 0.9608 0.0194 
3 0.9418 0.0190 
4 0.9321 0.0186 
5 0.9048 0.0183 


Table K.2 Calculation of the Present Value of Expected Payments (payment = s per annum) 


Time Probability Expected Discount PV of Expected 
(years) of Survival Payment Factor Payment 
1 0.9802 0.9802s 0.9512 0.93245 
2 0.9608 0.9608s 0.9048 0.8694s 
3 0.9418 0.9418s 0.8607 0.8106s 
4 0.9231 0.92315 0.8187 0.7558s 
5 0.9048 0.90485 0.7788 0.7047s 
Total 4.0728 


Table K.3 Calculation of the Present Value of Expected Payoff (notional principal = $1) 


Time Probability Recovery Expected Discount PV of Expected 
(years) of Default Rate Payoff ($) Factor Payoff ($) 
0.5 0.0198 0.4 0.0119 0.9753 0.0116 

1.5 0.0194 0.4 0.0116 0.9277 0.0108 

2.5 0.0190 0.4 0.0114 0.8825 0.0101 

3.5 0.0186 0.4 0.0112 0.8395 0.0094 

4.5 0.0183 0.4 0.0110 0.7985 0.0088 
Total 0.0506 


Table K.4 Calculation of the Present Value of Accrual Payment 


Time Probability Expected Discount PV of Expected 
(years) of Default Accrual Payment Factor Accrual Payment 
0.5 0.0198 0.0099s 0.9753 0.00975 
1.5 0.0194 0.0097s 0.9277 0.0090s 
2.5 0.0190 0.0095s 0.8825 0.00845 
3:5 0.0186 0.0093s 0.8395 0.0078s 
4.5 0.0183 0.0091s 0.7985 0.0073s 


Total 0.0422s 
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Table K.3 shows the calculation of the present value of the expected payoff assuming 
a notional principal of $1. As mentioned earlier, we are assuming that defaults always 
happen halfway through a year. For example, there is a 0.0190 probability of a payoff 
halfway through the third year. Given that the recovery rate is 40%, the expected payoff 
at this time is 0.0190 x 0.6 x 1 = 0.0114. The present value of the expected payoff is 
0.0114e~°-*2> = 0.0101. The total present value of the expected payoffs in Table K.3 
is $0.0506. 

As a final step, we evaluate in Table K.4 the accrual payment made in the event of a 
default. These accrual payments arise because the spread payments s are made in arrears so 
that, when a default occurs,a portion ofa spread payment is owed. Consider the third year. 
There is a 0.0190 probability of a default halfway through the year. An accrual payment of 
0.5s is due if there is a default. The expected accrual payment halfway through the third 
year is therefore 0.0190 x 0.5s = 0.0095s. Its present value is 0.0095se~°*?> = 0.0084s. 
The total present value of the expected accrual payments is 0.0422s. 

From Tables K.2 and K.4, the present value of the expected payments is 


4.07285 + 0.04225 = 4.1150s 


From Table K.3, the present value of the expected payoff is 0.0506. Equating the two, 
the CDS spread for a new CDS is given by 


4.1150s = 0.0506 


or s = 0.0123. The mid-market spread should be 0.0123 times the principal or 123 basis 
points per year. (This is roughly what we would expect from the relationship in equation 
(19.3): the recovery rate is 40% and the hazard rate is 2% so that equation (19.3) predicts 
a spread of 0.02 X 0.6 = 0.012.) 

This example is designed to illustrate the calculation methodology. In practice, we 
are likely to find that the calculations are more extensive than those in Table K.2 to K.4 
because (a) payments are often made more frequently than once a year and (b) we want 
to assume that defaults can happen more frequently than once a year. 


K.1 Marking to Market a CDS 


At the time it is negotiated, a CDS, like most other swaps, is worth close to zero. At later 
times it may have a positive or negative value. Suppose, for example, the credit default 
swap in our example had been negotiated some time ago for a spread of 150 basis points, 
the present value of the payments by the buyer would be 4.1150 x 0.0150 = 0.0617 and 
the present value of the payoff would be 0.0506, as given by Table K.3. The value of 
the swap to the seller would therefore be 0.0617 — 0.0506 or 0.0111 times the principal. 
Similarly, the mark-to-market value of the swap to the buyer of protection would be 
—0.0111 times the principal. 

The RMFI software that accompanies this book and can be downloaded from the 
author’s website includes a worksheet for carrying out the above calculations. 


Appendix L 


Synthetic CDOs and 
Their Valuation 


ynthetic collateralized debt obligations (CDOs) consist of tranches where one 
S party (Party A) agrees to make payments to another party (Party B) that are equal 
to those losses on a specified portfolio of debt instruments that are in a certain 
range. In return, Party B agrees to make payments to Party A that are a certain proportion 
of the amount of principal that is being insured. 
Suppose that the range of losses for a particular tranche is from Qz to y. The variables 
a, and Qy are known as the attachment point and detachment point, respectively. If a, is 
8% and Qy is 18%, Party A pays to Party B the losses on the portfolio, as they are incurred, 
in the range 8% to 18% of the total principal of the portfolio. The first 8% of losses on 
the portfolio does not therefore affect the tranche. The tranche is responsible for the next 
10% of losses and its notional principal (initially 18 — 8 = 10% of the portfolio principal) 
reduces as these losses are incurred. The tranche is wiped out when losses exceed 18%. 
The payments that are made by Party B to Party A are made periodically at a specified 
rate applied to the remaining notional tranche principal. This specified rate is known as 
the tranche spread. 
The usual assumption is that all the debt instruments in the portfolio have the same 
probability distribution for the time to default. Define Q(f) as the probability of a debt 


instrument defaulting by time t. The one-factor Gaussian copula model of time to default 
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presented in Section 11.6 has become the standard market model for valuing a tranche 
of a collateralized debt obligation (CDO). From equation (11.13) 


N~"[Q()] - (PF 
QUE = N3 — (L.1) 
Vi=p 


where Q(t|F) is the probability of the ith entity defaulting by time t conditional on the 
value of the factor, F. In the calculation of Q(f) it is usually assumed that the hazard rate 
for a company is constant. When a CDS spread or other credit spread is available, it can 
be used to determine the hazard rate using calculations similar to those in Appendix K 
in conjunction with a search procedure. 

Suppose that the hazard rate is A. Then: 


Q(t) =1-e™ (L.2) 


From the properties of the binomial distribution, the probability of exactly k defaults by 
time f, conditional on F is 


! 
Plk, t|F) = —“— au PT — QUIE" L.3 
(k, t|F) m-pa CI Q(t|F)] (L.3) 
Define 
He ee A O 


where R is the recovery rate (assumed constant). Also, define m(x) as the smallest integer 
greater than x. The tranche suffers no losses when the number of defaults, k, is less than 
m(n,). It is wiped out when k is greater than or equal to m(ny). Otherwise the tranche 
principal at time t is a proportion 


Oy, — k(1 — R)/n 
Op — Oy 


of the initial tranche principal. These results can be used in conjunction with equations 
(L.1), (L.2), and (L.3) to calculate the expected tranche principal at all times condi- 
tional on F. We can then integrate over F to find the (unconditional) expected tranche 
principal. This integration is usually accomplished with a procedure known as Gaus- 
sian quadrature. (The author’s website provides the tools for integrating over a normal 
distribution using Gaussian quadrature.) 

It is usually assumed that defaults happen at the midpoint of the intervals between 
payments. Similarly to Appendix K, we are interested in the following quantities: 


1. The present value of the expected spread payments received by Party A, 
2. The present value of the expected payments for tranche losses made by Party A, and 
3. The present value of accrual payments received by Party A. 
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The spread payments received by Party A at a particular time are linearly dependent on 
the tranche principal at that time. The tranche loss payments made by Party A (assumed 
to be at the midpoint of an interval) is the change in the principal during the interval. The 
accrual payment received by Party A is proportional to the tranche loss payments. For any 
assumption about spreads, all three quantities of interest can therefore be calculated from 
the expected tranche principal on payment dates. The break-even spread can therefore 
be calculated analogously to the way it is calculated for CDSs in Appendix K. 

Derivatives dealers calculate an implied copula correlation, p, from the spreads quoted 
in the market for tranches of CDOs and tend to quote these rather than the spreads them- 
selves. This is similar to the practice in options markets of quoting Black—Scholes—Merton 
implied volatilities rather than dollar prices. Two correlations are used. The compound 
(tranche) correlation for a tranche is the correlation consistent with the tranche’s market 
value. The base correlation for x% is the correlation consistent with the market value of 
a tranche with attachment point at zero and detachment point at x%. There is a correla- 
tion smile phenomenon in CDO markets similar to the volatility smile phenomenon in 
options markets. 

The RMFI software that accompanies this book and can be downloaded from the 


author’s website includes a worksheet for carrying out the above calculations.! 


'More details on the calculation can be found in J. C. Hull, Options, Futures, and Other Derivatives, 10th 
ed. (Upper Saddle River, NJ: Pearson, 2018). 


Answers to Questions 
and Problems 


Chapter 1 


1.1 The expected return is 12.5%. The standard deviation of returns is 17.07%. 
1.2 From equations (1.1) and (1.2), expected return is 12.5%. SD of return is 


0.52 X 0.17072 + 0.5 X 0.17077 + 2 x 0.15 x 0.52 x 0.17077 = 0.1294 


or 12.94%. 
1.3 m Wy Hp Sp (P = 0.3) Sp (p = 1) Op (P = —1) 

0.0 1.0 15% 24.00% 24.00% 24.00% 
0.2 0.8 14% 20.39% 22.40% 16.00% 
0.4 0.6 13% 17.42% 20.80% 8.00% 
0.6 0.4 12% 15.48% 19.20% 0.00% 
0.8 0.2 11% 14.96% 17.60% 8.00% 
1.0 0.0 10% 16.00% 16.00% 16.00% 


1.4 Nonsystematic risk can be diversified; systematic risk cannot.Systematic risk is more 
important to an equity investor. Either type of risk can lead to the bankruptcy of a 
corporation. 

1.5 We assume that investors trade off mean return and standard deviation of return. For 
a given mean return, they want to minimize standard deviation of returns. All make 
the same estimates of means, standard deviations, and coefficients of correlation for 
returns on individual investments. Furthermore, they can borrow or lend at the 
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risk-free rate. The result is that they all want to be on the “new efficient frontier” 
in Figure 1.4. They choose the same portfolio of risky investments combined with 
borrowing or lending at the risk-free rate. 

1.6 (a) 7.2%, (b) 9%, (c) 14.4%. 

1.7 The capital asset pricing model assumes that there is one factor driving returns. 
Arbitrage pricing theory assumes multiple factors. 

1.8 In many jurisdictions, interest on debt is deductible to the corporation whereas 
dividends are not deductible. It can therefore be more tax-efficient for a company 
to fund itself with debt. However, as debt increases, the probability of bankruptcy 
increases. 

1.9 Risk decomposition refers to a procedure where risks are handled one by one. 
Risk aggregation refers to a procedure where a portfolio of risks is considered. 
Risk decomposition requires an in-depth understanding of individual risks. Risk 
aggregation requires an understanding of the correlations between risks. 

1.10 When potential losses are large, we cannot aggregate them and assume they will be 
diversified away. It is necessary to consider them one by one and handle them with 
insurance contracts, tighter internal controls, and the like. 

1.11 This is the probability that profit is no worse than —4% of assets. This profit level is 
4.6/1.5 = 3.067 standard deviations from the mean. The probability that the bank 
will have a positive equity is therefore N(3.067) where N is the cumulative normal 
distribution function. This is 99.89%. 

1.12 Banks are allowed to accept deposits from the general public. Companies in retailing 
and manufacturing are not. 

1.13 Professional fees ($10 million per month), lost sales (people are reluctant to do 
business with a company that is being reorganized), and loss of key senior executives 
(lack of continuity). 

1.14 The return earned by the hedge fund manager with zero alpha would be 0.05 + 
0.6 X (0.10 — 0.05) = 0.08 or 8%. Because the alpha equals 4%, the hedge fund 
manager’s return was 8% plus 4%, or 12%. 


Chapter 2 


2.1 The banking system became more concentrated, with large banks having a bigger 
share of the market. The total number of banks decreased from 14,483 to 5,060. 

2.2 In the early twentieth century, many states passed laws restricting banks from open- 
ing more than one branch. The McFadden Act of 1927 restricted banks from open- 
ing branches in more than one state. 

2.3 The main risk is that interest rates will rise so that, when deposits are rolled over, 
the bank has to pay a higher rate of interest. The result will be a reduction in the 
bank’s net interest income. There is also liquidity risk. 

2.4 DLC’s loss is more than its equity capital, and it would probably be liquidated. 
The subordinated long-term debt holders would incur losses on their $5 million 
investment. The depositors should get their money back. 
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2.5 The net interest income of a bank is interest recetved minus interest paid. 

2.6 Credit risk primarily affects loan losses. Non-interest income includes trading gains 
and losses. Market risk therefore affects non-interest income. It also affects net inter- 
est income if assets and liabilities are not matched. Operational risk primarily affects 
non-interest expense. 

2.7 A private placement is a new issue of securities that is sold to a small number of 
large institutional investors. A public offering is a new issue of securities that is 
offered to the general public. In a best efforts deal, the investment bank does as well 
as it can to place securities with investors, but does not guarantee that they can be 
sold. In a firm commitment deal, the investment bank agrees to buy the securities 
from the issuing company for a particular price and attempts to sell them in the 
market for a higher price. 

2.8 The bidders when ranked from the highest price bid to the lowest are: H, C, FE A, B, 
D, E, and G. Bidders H, C, and F have bid for 140,000 shares. A has bid for 20,000. 
The price that clears the market is the price that was bid by A, or $100. H, C, and 
F get their orders filled at this price. Half of A’s order is filled at this price. 

2.9 A Dutch auction potentially attracts a wide range of bidders. If all interested market 
participants bid, the price paid should be close to the market price immediately after 
the IPO. The usual IPO situation where the price turns out to be well below the 
market price should therefore be avoided. Also, investment banks are not able to 
restrict purchasers to their best current and potential clients. The Google IPO was 
different from a standard Dutch auction in that Google reserved the right to choose 
the number of shares that would be issued, and the percentage allocated to each 
bidder, when it saw the bids. 

2.10 Poison pills can give management a negotiation tool, particularly if the board has 
the right to overturn a poison pill or make it ineffective. When it is confronted 
with a potential acquirer, the poison pill can buy the company time to bargain for 
a better purchase price or find other bidders. However, there is the danger that the 
poison pill will discourage potential buyers from approaching the company in the 
first place. 

2.11 The brokerage subsidiary of a bank might recommend securities that the invest- 
ment banking subsidiary is trying to sell. The commercial banking subsidiary might 
pass confidential information about its clients to the investment banking subsidiary. 
When a bank does business with a company (or wants to do business with the 
company), it might persuade the brokerage subsidiary to recommend the company’s 
shares as a “buy.” The commercial banking subsidiary might persuade a company to 
which it has lent money to do a bond issue because it is worried about its exposure 
to the client. (It wants the investment banking subsidiary to persuade its clients to 
take on the credit risk.) These conflicts of interest are handled by what are known 
as Chinese walls. They prevent the flow of information from one part of the bank 
to another. 

2.12 The trading book includes the instruments that a bank has as a result of its trading 
activities. The banking book consists of instruments it does not expect to trade, 
such as loans to corporations and individuals. 
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2.13 As a result of IFRS 9 and similar accounting standards from FASB, banks have to 
record loans they have made in their financial statements at the amount owed less 
an estimate of expected losses. 

2.14 In the originate-to-distribute model, a bank originates loans and then securitizes 
them so that they are passed on to investors. This was done extensively with house- 
hold mortgages during the seven-year period leading up to July 2007. In July 2007, 
investors lost confidence in the securitized products, and banks were forced to aban- 
don the originate-to-distribute model, at least temporarily. 


Chapter 3 


3.1 Term life insurance lasts a fixed period (e.g., five years or ten years). The policy- 
holder pays premiums. If the policyholder dies during the life of the policy, the 
policyholder’s beneficiaries receive a payout equal to the principal amount of the 
policy. Whole life insurance lasts for the whole life of the policyholder. The poli- 
cyholder pays premiums (usually the same each year), and the policyholder’s ben- 
eficiaries receive a payout equal to the principal amount of the policy when the 
policyholder dies. There is an investment element to whole life insurance because 
the premiums in early years are high relative to the expected payout in those years. 
(The reverse is true in later years.) 

3.2 Variable life insurance is whole life insurance where the policyholder can specify 
how the funds generated in early years (the excess of the premiums over the actu- 
arial cost of the insurance) are invested. There is a minimum payout on death, but 
the payout can be more than the minimum if the investments do well. Universal 
life insurance is whole life insurance where the premium can be reduced to a spec- 
ified minimum level without the policy lapsing. The insurance company chooses 
the investments (generally fixed income) and guarantees a minimum return. If the 
investments do well, the return provided on the policyholder’s death may be greater 
than the guaranteed minimum. 

3.3 Annuity contracts have exposure to longevity risk. Life insurance contracts have 
exposure to mortality risk. 

3.4 The lifetime annuity created from an accumulated value was calculated using an 
interest rate that was the greater of (a) the market interest rate and (b) a prespecified 
minimum interest rate. 

3.5 The probability that the woman will die during the first year is 0.003182. The 
probability that the woman will die during the second year is 0.003473 x (1 — 
0.003182) = 0.003462. Suppose that the break-even premium is X. We must have 


1,000,000 x (0.003182 + 0.003462) = X + (1 — 0.003182).X 


so that X = 3,327. The break-even premium is therefore $3,327. 
3.6 The probability of a male surviving to 30 is 0.97519. The probability of a male 
surviving to 90 is 0.17735. The probability ofa male surviving to 90 conditional that 
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30 is reached is therefore 0.17735/0.97519 = 0.18186. The probability of a female 
surviving to 90 conditional that 30 is reached is 0.29104/0.98635 = 0.29507. 

3.7 The biggest risks are those arising from catastrophes such as earthquakes and hur- 
ricanes and those arising from liability insurance (e.g., claims related to asbestos in 
the United States). This is because there is no “law of large numbers” working in 
the insurance company’s favor. Either the event happens and there are big payouts 
or the event does not happen and there are no payouts. 

3.8 CAT bonds (catastrophe bonds) are an alternative to reinsurance for an insurance 
company that has taken on a certain catastrophic risk (e.g., the risk of a hurricane 
or an earthquake) and wants to get rid of it. CAT bonds are issued by the insurance 
company. They provide a higher rate of interest than risk-free bonds. However, 
the bondholders agree to forgo interest, and possibly principal, to meet any claims 
against the insurance company that are within a prespecified range. 

3.9 The CAT bond has very little systematic risk. Whether a particular type of catas- 
trophe occurs is independent of the return on the market. The risks in the CAT 
bond are to some extent diversified away by the other investments in the portfolio. 
A B-rated bond does have systematic risk so that less of its risks are diversified away. 
It is likely therefore that the CAT bond is a better addition to the portfolio. 

3.10 In Canada and the United Kingdom, health care is provided by the government. 
In the United States, publicly funded health care is limited and most individuals 
buy private health care insurance of one sort or another. In the United Kingdom, 
a private health care system operates alongside the public system. 

3.11 Both moral hazard and adverse selection are potential problems. The insurance 
might lead to an individual not trying to keep a job as much as he or she otherwise 
would. Indeed, an individual might purposely lose his or her job to collect the 
insurance payout! Also, individuals who are most at risk for losing their jobs would 
be the ones who would choose to buy the insurance. 

3.12 The payouts of property—casualty insurers show more variability than the payouts 
of life insurers. This is because of the possibility of catastrophes such as earthquakes 
and hurricanes and liability insurance claims such as those related to asbestos in the 
United States. 

3.13 The loss ratio is the ratio of payouts to premiums in a year. The expense ratio is 
the ratio of expenses (e.g., sales commissions and expenses incurred in validating 
losses) to premiums in a year. The statement is not true because investment income 
can be significant. Premiums are received at the beginning of a year, and payouts 
are made during the year or after the end of the year. 

3.14 A defined contribution plan is a plan where the contributions of each employee 
(together with contributions made by the employer for that employee) are kept in 
a separate account and invested for the employee. When retirement age is reached, 
the accumulated amount is usually converted into an annuity. In a defined benefit 
plan, all contributions for all employees are pooled and invested. Employees receive 
a predefined pension that is based on their years of employment and final salary. At 
any given time, a defined benefit plan may be in surplus or in deficit. 
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3.15 The employee’s wages are constant in real terms. Suppose that they are X per year. 
(The units for X do not matter for the purposes of our calculation.) The pension 
is 0.75X. The real return earned is zero. Because employees work for 40 years, the 
present value of the contributions made by one employee is 40XR where R is the 
contribution rate as a percentage of the employee’s wages. The present value of the 
benefits is 20 x 0.75X = 15X. The value of R that is necessary to adequately fund 
the plan must therefore satisfy 


40XR = 15X 


The solution to this equation is R = 0.375. The total of the employee contributions 
(and employer contributions, if any) should therefore be 37.5% of salary. 


Chapter 4 


4.1 The number of shares of an open-end mutual fund increases as investments in the 
fund increase and decreases as investors withdraw their funds. A closed-end fund is 
like any other corporation with a fixed number of shares that trade. 

4.2 The net asset value (NAV) of an open-end mutual fund is calculated at 4 p.m. each 
day as the value of the assets held by the fund divided by the number of shares 
outstanding. 

4.3 The investor is deemed to have made capital gains of $300 and $100 in 2018 and 
2019, respectively. In 2020, the investor is deemed to have made a capital loss of 
$200. 

4.4 An index fund is a fund that is designed so that its value tracks the performance 
of an index such as the S&P 500. It can be created by buying all the stocks (or 
a representative subset of the stocks) that underlie the index. Sometimes futures 
contracts on the index are used. 

4.5 The front-end load is the amount an investor pays, as a percentage of his or her 
investment, when shares of the fund are purchased. The back-end load is the amount 
an investor pays, as a percentage of his or her investment, when shares of the fund 
are redeemed. 

4.6 An exchange-traded fund (ETF) that tracks an index is created when an institutional 
investor deposits a portfolio of shares that is designed to track the index and receives 
shares in the ETE Institutional investors can at any time exchange shares in the ETF 
for the underlying shares held by the ETE or vice versa. The advantages over an 
open-end mutual fund that tracks the index are that the fund can be traded at any 
time, the fund can be shorted, and the fund does not have to be partially liquidated 
to accommodate redemptions. The advantage over a closed-end mutual fund is that 
there is very little difference between the ETF share price and the net asset value 
per share of the fund. 

4.7 The arithmetic mean ofa set of n numbers is the sum of the numbers divided by n. 
The geometric mean is the nth root of the product of the numbers. The arithmetic 
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mean is always greater than or equal to the geometric mean. The return per year 
realized when an investment is held for several years is calculated using a geometric 
mean, not an arithmetric mean. (The procedure is to calculate the geometric mean 
of one plus the return in each year and then subtract one.) 

4.8 Late trading is the illegal practice of putting in an order to buy or sell an open- 
end mutual fund at the 4 p.m. price after 4 P.M. Market timing is a practice where 
favored clients are allowed to buy and sell a mutual fund frequently to take advantage 
of the fact that some prices used in the calculation of the 4 P.M. net asset value 
are stale. Front running is the practice of trading by individuals ahead of a large 
institutional trade that is expected to move the market. Directed brokerage describes 
the situation where a mutual fund uses a brokerage house for trades when the 
brokerage house recommends the fund to clients. 

4.9 Mutual funds must disclose their investment policies; their use of leverage is limited; 
they must calculate NAV daily; their shares must be redeemable at any time. 

4.10 Ifa hedge fund is making money out of trading convertible bonds, it must be doing 
so at the expense of its counterparties. If most of the traders are hedge funds, they 
cannot all be making money. 

4.11 Hurdle rate is the minimum return necessary for an incentive fee to be applicable. 
High-water mark refers to the previous losses that must be recouped before incen- 
tive fees are applicable. Clawback refers to investors being able to use some of the 
past incentive fees they have paid as an offset to current losses. 

4.12 If the return is X(> 2%), the investors pay 0.02 + 0.2(X — 0.02) in fees. It must 
therefore be the case that 


X — 0.02 — 0.2(X — 0.02) = 0.2 


so that 0.8X = 0.216 or X = 0.27. A return of 27% is necessary. 

4.13 Short-term gains and losses do matter if the hedge fund is highly leveraged. Short- 
term losses can lead to margin calls that can destroy the hedge fund. 

4.14 The leverage a hedge fund 1s allowed to take is limited by its prime broker. This in 
turn influences the risks that the hedge fund can take. 


Chapter 5 


5.1 When a trader enters into a long forward contract, she is agreeing to buy the under- 
lying asset for a certain price at a certain time in the future. When a trader enters 
into a short forward contract, she is agreeing to sell the underlying asset for a certain 
price at a certain time in the future. 

5.2 A trader is hedging when she has an exposure to the price of an asset and takes a 
position in a derivative to offset the exposure. In a speculation, the trader has no 
exposure to offset. She is betting on the future movements in the price of the asset. 
Arbitrage involves taking a position in two or more different markets to lock in a 
profit. 
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5.3 In the first case, the trader is obligated to buy the asset for $50. (The trader does 
not have a choice.) In the second case, the trader has an option to buy the asset for 
$50. (The trader does not have to exercise the option.) 

5.4 Selling a call option involves giving someone else the right to buy an asset from 
you for a certain price. Buying a put option gives you the right to sell the asset to 
someone else. 

5.5 (a) The investor is obligated to sell pounds for 1.3000 when they are worth 1.2900. 

The gain is (1.3000 — 1.2900) x 100,000 = $1,000. 
(b) The investor is obligated to sell pounds for 1.3000 when they are worth 1.3200. 
The loss is (1.3200 — 1.3000) x 100,000 = $2,000. 
5.6 (a) The trader sells for 50 cents per pound something that is worth 48.20 cents per 
pound. Gain = ($0.5000 — $0.4820) x 50,000 = $900. 
(b) The trader sells for 50 cents per pound something that is worth 51.30 cents per 
pound. Loss = ($0.5130 — $0.5000) x 50,000 = $650. 

5.7 You have sold a put option. You have agreed to buy 100 shares for $40 per share if 
the party on the other side of the contract chooses to exercise the right to sell for 
this price. The option will be exercised only when the price of the stock is below 
$40. Suppose, for example, that the option is exercised when the price is $30. You 
have to buy at $40 shares that are worth $30; you lose $10 per share, or $1,000 in 
total. If the option is exercised when the price is $20, you lose $20 per share, or 
$2,000 in total. The worst that can happen is that the price of the stock declines 
to almost zero during the three-month period. This highly unlikely event would 
cost you $4,000. In return for the possible future losses, you receive the price of the 
option from the purchaser. 

5.8 The over-the-counter (OTC) market is a market where financial institutions, fund 
managers, and corporate treasurers get in touch with each other directly and can 
enter into relatively large mutually acceptable transactions. An exchange-traded 
market is a market organized by an exchange where traders either meet physically or 
communicate electronically and the contracts that can be traded have been defined 
by the exchange. (a) OTC, (b) exchange, (c) both, (d) OTC, (e) OTC. 

5.9 One strategy would be to buy 200 shares. Another would be to buy 2,000 options. 
If the share price does well, the second strategy will give rise to greater gains. For 
example, if the share price goes up to $40, you gain [2,000 x ($40 — $30)] — $5,800 
= $14,200 from the second strategy and only 200 x ( $40 — $29) = $2,200 from 
the first strategy. However, if the share price does badly, the second strategy gives 
greater losses. For example, if the share price goes down to $25, the first strategy 
leads to a loss of 200 X ( $29 — $25) = $800, whereas the second strategy leads to 
a loss of the whole $5,800 investment. This example shows that options contain 
built-in leverage. 

5.10 You could buy 5,000 put options (or 50 contracts) with a strike price of $25 and 
an expiration date in four months. This provides a type of insurance. If, at the end 
of four months, the stock price proves to be less than $25, you can exercise the 
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options and sell the shares for $25 each. The cost of this strategy is the price you 
pay for the put options. 

5.11 A stock option provides no funds for the company. It is a security sold by one trader 
to another. The company is not involved. By contrast, a stock when it is first issued 
is a claim sold by the company to investors and does provide funds for the company. 

5.12 Ignoring the time value of money, the holder of the option will make a profit if 
the stock price in March is greater than $52.50. This is because the payoff to the 
holder of the option is, in these circumstances, greater than the $2.50 paid for the 
option. The option will be exercised if the stock price at maturity is greater than 
$50.00. Note that, if the stock price is between $50.00 and $52.50, and the option 
is exercised, the holder of the option takes a loss overall. 

5.13 Ignoring the time value of money, the seller of the option will make a profit if the 
stock price in June is greater than $56.00. This is because the cost to the seller of 
the option is in these circumstances less than the price received for the option. The 
option will be exercised if the stock price at maturity is less than $60.00. Note that 
if the stock price is between $56.00 and $60.00, the seller of the option makes a 
profit even though the option is exercised. 

5.14 A long position in a four-month put option can provide insurance against the 
exchange rate falling below the strike price. It ensures that the foreign currency 
can be sold for at least the strike price. 

5.15 The company could enter into a long forward contract to buy 1 million Canadian 
dollars in six months. This would have the effect of locking in an exchange rate 
equal to the current forward exchange rate. Alternatively, the company could buy 
a call option giving it the right (but not the obligation) to purchase 1 million 
Canadian dollars at a certain exchange rate in six months. This would provide 
insurance against a strong Canadian dollar in six months while still allowing the 
company to benefit from a weak Canadian dollar at that time. 

5.16 The payoff from an ICON is the payoff from: 

(a) A regular bond 

(b) A short position in call options to buy 169,000 yen with an exercise price of 
1/169 

(c) A long position in call options to buy 169,000 yen with an exercise price of 
1/84.5 
This is demonstrated by the following table: 


Terminal Terminal Terminal Terminal 
Value of Value of Value of Value of 
Regular Bond Short Calls Long Calls Whole Position 
Sr > 169 1,000 0 0 1,000 
84.5 < Sp < 169 1,000 ~169,000 (+ — 5) 0 2,000 — 162000 
Sp 169 Sr 


Sr < 84.5 1,000 —169,000 (+ = 5) 169,000 (+ = =) 0 
Sr 169 are 
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5.17 (a) The trader buys a 180-day call option and takes a short position in a 180-day 
forward contract. 
(b) The trader buys 90-day put options and takes a long position in a 90-day for- 
ward contract. 

5.18 It enters into a five-year swap where it pays 3.30% and receives LIBOR. Its invest- 
ment is then at LIBOR minus 0.30%. 

5.19 It enters into a five-year swap where it receives 3.26% and pays LIBOR. Its net cost 
of borrowing is LIBOR plus 1.74%. 

5.20 It enters into a three-year swap where it receives LIBOR and pays 3%. Its net 
borrowing cost for the three years is then 4% per annum. 

5.21 Suppose that the weather is bad and the farmer’s production is lower than expected. 
Other farmers are likely to have been affected similarly. Corn production overall will 
be low and as a consequence the price of corn will be relatively high. The farmer is 
likely to be overhedged relative to actual production. The farmer’s problems arising 
from the bad harvest will be made worse by losses on the short futures position. This 
problem emphasizes the importance of looking at the big picture when hedging. 
The farmer is correct to question whether hedging price risk while ignoring other 
risks is a good strategy. 

5.22 It may well be true that there is just as much chance that the price of oil in the 
future will be above the futures price as that it will be below the futures price. This 
means that the use of a futures contract for speculation would be like betting on 
whether a coin comes up heads or tails. But it might make sense for the airline 
to use futures for hedging rather than speculation. The futures contract then has 
the effect of reducing risks. It can be argued that an airline should not expose its 
shareholders to risks associated with the future price of oil when there are contracts 
available to hedge the risks. 

5.23 Microsoft is choosing an option on a portfolio of assets instead of the correspond- 
ing portfolio of options. The former is always less expensive because there is the 
potential for an increase in the price of one asset to be netted off against a decrease 
in the price of another asset. Compare (a) an option with a strike price of $20 on 
a portfolio of two assets each worth $10 and (b) a portfolio of two options with 
a strike price of $10, one on each of the assets. If both assets increase in price or 
both assets decrease in price, the payoffs are the same. But if one decreases and the 
other increases, the payoff from (a) is less than that from (b). Both the Asian feature 
and the basket feature in Microsoft’s options help to reduce the cost of the options 
because of the possibility of gains and losses being netted. 

5.24 It means that the price of the energy source can go up or down but will tend over 
time to get pulled back to its long-run average level. Electricity has the highest rate 
of mean reversion; oil has the lowest. 

5.25 As we increase the frequency with which the asset price is observed, the asset price 
becomes more likely to hit the barrier and the value of a knock-out call goes down. 

5.26 The average of the highest and lowest temperature each day is 75° Fahrenheit. 
The CDD each day is therefore 10 and the cumulative CDD for the month is 
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10 x 31 = 310. The payoff from the call option is therefore (310 — 250) x 5,000 = 
$300,000. 

5.27 A 5X8 contract for May 2019 is a contract to provide electricity for five days 
per week during the off-peak period (11 p.m. to 7 A.M.). When daily exercise is 
specified, the holder of the option is able to choose each weekday whether he or 
she will buy electricity at the strike price. When there is monthly exercise, he or 
she chooses once at the beginning of the month whether electricity is to be bought 
at the strike price for the whole month. The option with daily exercise is worth 
more. 

5.28 Two calculations are necessary to determine the initial margin. The first gives 


500 x (3.5 + 0.2 x 57 — 3) = 5,950 
The second gives 
500 x (3.5 + 0.1 X 57) = 4,600 


The initial margin is the greater of these, or $5,950. Part of this can be provided by 
the initial amount of $500 x 3.5 = $1,750 received for the options. 

5.29 The cost of the shares is $500 x 50 or $25,000. When shares are shorted, the 
proceeds of the sale form part of the margin. In this case, the total margin required 
is 1.6 X $25,000, or $40,000. The extra margin required is therefore $15,000. This 
can be in the form of cash or marginable securities. When the share price rises to 
S, the value of the underlying position is 500S. There is a margin call when 


40,000 < 1.3 x 500S or S > 61.54. 


5.30 A broker sets an initial margin and a maintenance margin. When the balance in 
the margin account falls below the maintenance margin level, the broker’s client 
is required to bring it back up to the initial margin level. The exchange clearing 
house, when dealing with its members, sets the initial margin and maintenance 
margin to be the same. Every day the balance in the margin account must be at this 
level or more. 


Chapter 6 


6.1 Mortgages were frequently securitized. The only information that was retained 
during the securitization process was the applicant’s FICO score and the loan-to- 
value ratio of the mortgage. The originators knew this and so this was the only 
information that they cared about. 

6.2 There was a short-term imbalance between supply and demand because many peo- 
ple were persuaded to take out mortgages they could not afford. 
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6.3 When the loss rate on the mortgages is 5%, there are no losses on the mezzanine 
tranche of the ABS and therefore there are no losses on any of the tranches of the 
ABS CDO. When the loss rate on the mortgages is 12%, the loss on the mezzanine 
tranche of the ABS is 7/20 or 35%. The loss rate on the equity and mezzanine 
tranches of the ABS CDO is 100%. The loss rate on the senior tranche of the ABS 
CDO is 10/75 = 13.333%. 

6.4 Often a tranche is thin and the probability distribution of the loss is quite different 
from that on the bond. If a loss occurs, there is a high probability that it will be 
100%. A 100% loss is much less likely for a bond. 

6.5 An ABS is a set of tranches created from a portfolio of loans, bonds, credit card 
receivables, and so on. An ABS CDO is an ABS created from particular tranches 
(e.g., the BBB-rated tranches) of a number of different ABSs. 

6.6 Investors underestimated how high the default correlations between mortgages 
would be in stressed market conditions. Investors also did not always realize that the 
tranches underlying ABS CDOs were usually quite thin so that they were either 
totally wiped out or untouched. There was an unfortunate tendency to assume that 
a tranche with a particular rating could be considered to be the same as a bond with 
that rating. This assumption was not valid for the reasons just mentioned. 

6.7 Agency costs is a term used to describe the costs in a situation where the interests of 
two parties are not perfectly aligned. The incentives of traders, originators, valuers, 
the creators of the structured products, and the rating agencies arguably created 
agency costs. 

6.8 The waterfall defines how the interest and principal cash flows from the underlying 
portfolio are distributed to the tranches. In a typical arrangement, interest cash flows 
are first used to pay the most senior tranche its promised return on its outstanding 
principal. The cash flows (if any) that are left over are used to provide the next 
most senior tranche with its promised return on its outstanding principal, and so 
on. Principal cash flows are usually used first to repay the most senior tranche, then 
the next most senior tranche, and so on. The equity tranche receives principal and 
interest payments only when more senior tranches have been paid. 

6.9 An ABS CDO is created from tranches of ABSs (e.g., BBB tranches). A motivation 
on the part of the originator was that it could be difficult to find investors for the 
tranches directly. 

6.10 Mian and Sufi showed that regions where mortgage application denials were highest 
in the United States in 1996 were also regions where mortgage origination grew 
particularly fast during the 2000 to 2006 period. 

6.11 The mezzanine tranche of an ABS or ABS CDO is a tranche that is in the middle 
as far as seniority goes. It ranks below the senior tranches and therefore absorbs 
losses before they do. It ranks above the equity tranche (so that the equity tranche 
absorbs losses before it does). 

6.12 As default correlation increases, the senior tranche of an ABS becomes more risky 
because it is more likely to suffer losses. The equity tranche becomes less risky as the 
default correlation increases. To understand this, note that in the limit when there 
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is perfect correlation (and assuming no recoveries), all tranches have the same loss 
experience because either (a) all companies default or (b) no companies default. As 
we move toward the perfect correlation situation, the tranches become more similar 
so that the senior tranche becomes more risky and the equity tranche becomes less 
risky. 

6.13 The end-of-year bonus reflects performance during the year. Traders and other 
employees of banks are liable to be always focusing on their next end-of-year bonus 
(and not on possible later effects of their decisions) and therefore have a short-term 
time horizon for their decision making. A trend toward deferred bonuses that can 
be clawed back is changing this to some extent. 


Chapter 7 
7.1 From equation (7.3), the probability is N(d,) where 


_ In(50/70) + (0.12 — 0.27/2) x 2 
0.2/2 


The probability is therefore N(—0.4825) = 0.315. 
7.2 From equation (7.4), this is 


d, = —0.4825 


50 x exp[(0.12 — 0.22/2) x 2 — N7! (0.05) x 0.2 x V2] = 97.25 


or $97.25. 

7.3 Risk-neutral valuation states that, if we value a derivative on the assumption that 
investors are risk-neutral (i.e., investors do not require a higher expected return for 
bearing risks), we get the right answer for all worlds (including the real world). 

7.4 The expected value is higher in the real world. The higher value reflects the com- 
pensation for risk required by the investor. 

7.5 By buying both derivatives, one is certain to earn $2 on an initial investment of 
$98. The risk-free rate is therefore 2/98 = 2.04% with annual compounding. The 
risk-neutral probability of default is t where 1007/(1 + 0.0204) = 3 or m = 3.0612%. 

7.6 The risk-neutral probability of the payoff is given by equation (7.3) with V = 30, 
Sp = 25, 0 = 0.3, p = 0.03, and T = 0.25. It is 


(25/30) + (0.03 — 0.3? /2) x 0.25 
0.3 x 4y/0.25 


= 0.1074 


The value of the option is therefore 100 x 0.1074 x e~°-9%95 = $10.66. For the 
real-world probability of a payoff, we set u = 0.1 instead of 0.03 in the equation to 
get 0.1305. 
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7.7 The real world is used to simulate possible values of the underlying variables in one 
year. The risk-neutral world is then used to calculate the value of the portfolio at 
the one-year point for each simulation trial. 

7.8 Mean reversion is the term used to describe the tendency for the values of variables 
such as interest rates and commodity prices to be pulled back to a central value. Of 
course, an uncertain random movement is superimposed on this pull-back. 

7.9 Girsanov’s theorem tells us that when we move from the real world to the risk- 
neutral world or vice versa, the expected growth rates of market variables change 
but their volatilities remain the same. 


Chapter 8 


8.1 The value of the portfolio decreases by $10,500. 

8.2 The value of the portfolio increases by $400. 

8.3 In both cases, it increases by 0.5 X 30 X 2? or $60. 

8.4 A delta of 0.7 means that, when the price of the stock increases by a small amount, 
the price of the option increases by 70% of this amount. Similarly, when the price 
of the stock decreases by a small amount, the price of the option decreases by 70% 
of this amount. A short position in 1,000 options has a delta of —700 and can be 
made delta neutral with the purchase of 700 shares. 

8.5 A theta of —100 per day means that if one day passes with no change in either 
the stock price or its volatility, the value of the option position declines by $100. 
If a trader feels that neither the stock price nor its implied volatility will change, 
she should write an option with as big a negative theta as possible so that her 
position has a high positive theta. Relatively short-life, at-the-money options have 
the biggest negative theta. 

8.6 The gamma of an option position is the rate of change of the delta of the position 
with respect to the asset price. For example, a gamma of 0.1 would indicate that, 
when the asset price increases by a certain small amount, delta increases by 0.1 of 
this amount. When the gamma of an option writer’s position is large and negative 
and the delta is zero, the option writer will lose significant amounts of money if 
there is a large movement (either an increase or a decrease) in the asset price. 

8.7 To hedge an option position, it is necessary to create the opposite option position 
synthetically. For example, to hedge a long position in a put, it is necessary to create 
a short position in a put synthetically. It follows that the procedure for creating an 
option position synthetically is the reverse of the procedure for hedging the option 
position. 

8.8 A long position in either a put or a call option has a positive gamma. From Figure 
8.9, when gamma is positive, the hedger gains from a large change in the stock 
price and loses from a small change in the stock price. Hence the hedger will fare 
better in case (b). When the portfolio contains short option positions, the hedger 
will fare better in case (a). 
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8.9 The delta indicates that when the value of the euro exchange rate increases by 
$0.01, the value of the bank’s position increases by 0.01 X 30,000 = $300. The 
gamma indicates that when the euro exchange rate increases by $0.01, the delta of 
the portfolio decreases by 0.01 X 80,000 = 800. For delta neutrality, 30,000 euros 
should be shorted. When the exchange rate moves up to 0.93, we expect the delta 
of the portfolio to decrease by (0.93 — 0.90) x 80,000 = 2,400 so that it becomes 
27,600. To maintain delta neutrality, it is therefore necessary for the bank to unwind 
its short position by 2,400 euros so that a net 27,600 euros have been shorted. When 
a portfolio is delta neutral and has a negative gamma, a loss is experienced when 
there is a large movement in the underlying asset price. We can conclude that the 
bank is likely to have lost money. 

8.10 When used in the way described in the text, it does assume volatility is constant. 
In theory, we could implement a static options replication strategy where there are 
three dimensions: time, the stock price, and volatility. Prices are then matched on a 
surface in the three-dimensional space. 

8.11 Ten regular options are likely to be needed. This is because there are 10 equations 
to be satisfied, one for each point on the boundary. 

8.12 The payoff from an Asian option becomes more certain with the passage of time. 
As a result, the amount of uncertainty that needs to be hedged decreases with the 
passage of time. 

8.13 Consider a portfolio of options dependent on a single market variable. A single 
trade is all that is necessary to make the position delta neutral regardless of the 
number of options in the portfolio. 

8.14 The price, delta, gamma, vega, theta, and rho are 0.0217, —0.396, 5.415, 0.00203, 
—0.0000625, and —0.00119. Delta predicts that the option price should decrease by 
approximately 0.000396 when the exchange rate increases by 0.001. This is what 
we find. When the exchange rate is increased to 0.751, the option price decreases 
to 0.0213. 


Chapter 9 


9.1 In this case, the interest rate mismatch is $10 billion. The bank’s net interest income 
declines by $100 million each year for the next three years. 

9.2 If long-term rates were simply a reflection of expected future short-term rates, we 
would expect long rates to be less than short rates as often as they are greater than 
short rates. (This is based on the assumption that half of the time investors expect 
rates to increase, and half of the time investors expect rates to decrease.) Liquidity 
preference theory argues that long-term rates are high relative to expected future 
short-term rates. This means that long rates are greater than short rates most of the 
time. When long rates are less than short rates, the market is expecting a relatively 
steep decline in rates. 
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9.3 There are two reasons: 

1. The amount of capital a bank is required to hold to support an investment in 
Treasury bills and bonds (typically zero) is substantially smaller than the capital 
required to support a similar investment in other very-low-risk instruments. 

2. In the United States, Treasury instruments are given a favorable tax treatment 
compared with most other fixed-income investments because they are not taxed 
at the state level. 

9.4 In an overnight indexed swap, the geometric average of the federal funds rate for a 
period such as three months is exchanged for a prespecified fixed rate. 

9.5 The LIBOR-OIS spread is a measure of the reluctance of banks to lend to each 
other. 

9.6 Duration provides information about the effect of a small parallel shift in the yield 
curve on the value of a bond portfolio. The percentage decrease in the value of the 
portfolio equals the duration of the portfolio multiplied by the amount by which 
interest rates are increased in the small parallel shift. Its main limitation is that it 
applies only to parallel shifts in the yield curve that are small. 

9.7 (a) The bond’s price is 86.80, (b) the bond’s duration is 4.256 years, (c) the duration 
formula shows that when the yield decreases by 0.2% the bond’s price increases by 
0.74, and (d) recomputing the bond’s price with a yield of 10.8% gives a price of 
87.54, which is approximately consistent with (a) and (c). 

9.8 (a) The bond’ price is 88.91, (b) the bond’s modified duration is 3.843 years, (c) the 
duration formula shows that when the yield decreases by 0.2% the bond’s price 
increases by 0.68, and (d) recomputing the bond’s price with a yield of 10.8% 
(annually compounded) gives a price of 89.60, which is approximately consistent 
with (a) and (c). 

9.9 The bond’s price is 104.80. The duration of the bond is 5.35. The convexity is 
30.60. The eftect of a 1% increase in the yield is estimated by equation (9.4) as 


104.80 x (—0.01 Xx 5.35 + 0.5 X 30.60 x 0.0001) = —5.44 


The bond’s price actually changes to 99.36, which is consistent with the estimate. 

9.10 We can (a) perturb points on the yield curve (see Figure 9.5), (b) perturb sections 
of the yield curve (see Figure 9.7), and (c) perturb the market quotes used to create 
the yield curve. 

9.11 The deltas (changes in portfolio value per unit of factor with factor loading being 
assumed to be in basis points) are —5.64 and 225.63. 

9.12 The impact on the portfolio, measured as a proportion of the value of the portfolio, 
is 


—(0.2 x 0.001 + 0.6 x 0.0008 + 0.9 x 0.0007 + 1.6 x 0.0006 
+2.0 x 0.0005 — 2.1 x 0.0003 — 3.0 x 0.0001) = —0.00234 


The portfolio decreases by 0.234%. 
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9.13 Dollar duration is defined as the product of the duration of a portfolio and its value. 
Dollar convexity is defined as the product of the convexity of a portfolio and its 
value. 

9.14 The partial durations add up to the total duration. The DV01 is the total duration 
times the portfolio value times 0.0001. 


Chapter 10 


10.1 2 X V3, or 3.46%. 

10.2 The standard deviation of the percentage price change in one day is 25/ 252 or 
1.57%, and 95% confidence limits are from —3.09% to +3.09%. 

10.3 Volatility is much higher when markets are open than when they are closed. 
Traders therefore measure time in trading days rather than calendar days. 

10.4 Implied volatility is the volatility that leads to the option price equaling the mar- 
ket price when Black—Scholes—Merton assumptions are used. It is found by trial 
and error. Because different options have different implied volatilities, traders are 
not using the same assumptions as Black—Scholes—Merton. (See Chapter 25 for a 
further discussion of this.) 

10.5 The standard formula for calculating standard deviation gives 0.547% per day. The 
simplified approach in equation (10.4) gives 0.530% per day. 

10.6 The power law gives 0.01 = K x 5007? so that K = 2,500. (a) 2,500 x 1,0007? = 
0.0025 or 0.25%, and (b) 2,500 x 2,10007? = 0.000625 or 0.0625%. 

10.7 The variance rate estimate calculated at the end of day n — 1 for day n equals À 
times the variance rate calculated at the end of day n — 2 for day n — 1 plus 1 — À 
times the squared return on day n — 1. This is equivalent to assigning weights to 
squared returns that decline exponentially as we move back through time. 

10.8 GARCH(1,1) adapts the EWMA model by giving some weight to a long-run 
average variance rate. Whereas the EWMA has no mean reversion, GARCH(1,1) 
is consistent with a mean-reverting variance rate model. 

10.9 In this case, 6,_; = 0.015 and u,_, = 0.5/30 = 0.01667, so that equation (10.8) 
gives 


0? = 0.94 x 0.0157 + 0.06 x 0.016677 = 0.0002281 


The volatility estimate on day n is therefore \/0.0002281 = 0.015103 or 1.5103%. 
10.10 Reducing A from 0.95 to 0.85 means that more weight is given to recent obser- 
vations of u? and less weight is given to older observations. Volatilities calculated 
with à = 0.85 will react more quickly to new information and will bounce around 


much more than volatilities calculated with A = 0.95. 
10.11 With the usual notation u,_, = 20/1,040 = 0.01923 so that 


o? = 0.000002 + 0.06 x 0.019237 + 0.92 x 0.017 = 0.0001162 


This gives 6,, = 0.01078. The new volatility estimate is therefore 1.078% per day. 
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10.12 The proportional daily change is —0.005/1.5000 = —0.003333. The current daily 
variance estimate is 0.0067 = 0.000036. The new daily variance estimate is 


0.9 X 0.000036 + 0.1 x 0.003333? = 0.000033511 


The new volatility is the square root of this. It is 0.00579 or 0.579%. 

10.13 The weight given to the long-run average variance rate is 1 — a — B and the long- 
run average variance rate is @/(1 — a — p). Increasing œ increases the long-run 
average variance rate. Increasing increases the weight given to the most recent 
data item, reduces the weight given to the long-run average variance rate, and 
increases the level of the long-run average variance rate. Increasing B increases 
the weight given to the previous variance estimate, reduces the weight given to 
the long-run average variance rate, and increases the level of the long-run average 
variance rate. 

10.14 The long-run average variance rate is w@/(1-—a-—f) or 0.000004/0.03 = 
0.0001333. The long-run average volatility is y 0.0001333 or 1.155%. The equa- 


tion describing the way the variance rate reverts to its long-run average is 
2 J= a 
Elev g V; + (a + P) (o; v) 
In this case, 


E|o?,] = 0.0001333 + 0.97" (6? — 0.0001333) 


If the current volatility is 20% per year, 6,, = 0.2/y 252 = 0.0126. The expected 
variance rate in 20 days is 


0.0001333 + 0.97" (0.0126? — 0.0001333) = 0.0001471 


The expected volatility in 20 days is therefore y 0.0001471 = 0.0121 or 1.21% 
per day. 

10.15 The FTSE expressed in dollars is XY where X is the FTSE expressed in sterling 
and Y is the exchange rate (value of one pound in dollars). Define x; as the pro- 
portional change in X on day i and y, as the proportional change in Y on day i. 
The proportional change in XY is approximately x, + y;. The standard deviation 
of x; is 0.018 and the standard deviation of y, is 0.009. The correlation between 
the two is 0.4. The variance of x; + y; is therefore 


0.018? + 0.0097 + 2 x 0.018 x 0.009 x 0.4 = 0.0005346 


so that the volatility of x; + y; is 0.0231 or 2.31%. This is the volatility of the FTSE 
expressed in dollars. Note that it is greater than the volatility of the FTSE expressed 
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in sterling. This is the impact of the positive correlation. When the FTSE increases, 
the value of sterling measured in dollars also tends to increase. This creates an even 
bigger increase in the value of FTSE measured in dollars. Similarly, when FTSE 
decreases, the value of sterling measured in dollars also tends to decrease, creating 
an even bigger decrease in the value of FTSE measured in dollars. 

10.16 In this case, V, = 0.000003/0.02 = 0.00015 and equation (10.14) gives the 
expected variance rate in 30 days as 


0.00015 + 0.98°°(0.017 — 0.00015) = 0.000123 


The volatility is y 0.000123 = 0.0111 or 1.11% per day. 
10.17 In this case, V, = 0.000002/0.02 = 0.0001. In equation (10.15), V, = 0.0001, 
a = 0.0202, T = 20, and V (0) = 0.000169 so that the volatility is 19.88%. 


Chapter 11 


11.1 You need the standard deviations of the two variables. 

11.2 Loosely speaking, correlation measures the extent of linear dependence. It does 
not measure other types of dependence. When y = x”, there is clearly dependence 
between x and y. However, E(xy) = E(x°). Both E(x) and E(x°) are zero when «x is 
normal (or when it has any symmetrical distribution centered at zero). It follows 
that the coefficient of correlation between x and y is zero. 

11.3 In a factor model, the correlation between two variables arises entirely because of 
their correlation with other variables known as factors. A factor model reduces 
the number of estimates that have to be made when correlations between large 
numbers of variables are being produced. 

11.4 A positive-semidefinite matrix is a matrix that satisfies equation (11.4) for all vec- 
tors w. If a correlation matrix is not positive-semidefinite, the correlations are 
internally inconsistent. 

11.5 (a) The volatilities and correlation imply that the current estimate of the covari- 

ance is 0.25 X 0.016x0.025 = 0.0001. 
(b) Ifthe prices of the assets at close of trading are $20.50 and $40.50, the propor- 
tional changes are 0.5/20 = 0.025 and 0.5/40 = 0.0125. The new covariance 


estimate 1s 
0.95 X 0.0001 + 0.05 x 0.025 x 0.0125 = 0.0001106 
The new variance estimate for asset A is 
0.95 x 0.0167 + 0.05 x 0.025? = 0.00027445 
so that the new volatility is 0.0166. The new variance estimate for asset B is 


0.95 x 0.0257 + 0.05 x 0.01257 = 0.000601562 
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so that the new volatility is 0.0245. The new correlation estimate is 


0.0001 106 


—_——_—_——— = 0.272 
0.0166 X 0.0245 


11.6 The most recent returns for X and Y are 1/30 = 0.03333 and 1/50 = 0.02, 
respectively. The previous covariance is 0.01 x 0.012 x 0.50 = 0.00006. The new 
estimate of the covariance is 


0.000001 + 0.04 x 0.03333 x 0.02 + 0.94 x 0.00006 = 0.0000841 
The new estimate of the variance of X is 
0.000003 + 0.04 x 0.033337 + 0.94 x 0.017 = 0.0001414 


so that the new volatility of X is y 00.0001414 = 0.01189 or 1.189%. The new 
estimate of the variance of Y is 


0.000003 + 0.04 x 0.02? + 0.94 x 0.0127 = 0.0001544 


so that the new volatility of Y is /0.0001544 = 0.01242 or 1.242%. The 
new estimate of the correlation between the assets is therefore 0.0000841/ 
(0.01189 x 0.01242) = 0.569. 

11.7 Continuing with the notation in the answer to Problem 10.15, define z; as the 
proportional change in the value of the S&P 500 on day i. The covariance between 
x; and z; is 


0.7 x 0.018 x 0.016 = 0.0002016 
The covariance between y; and z; is 


0.3 X 0.009 X 0.016 = 0.0000432 


The covariance between x; + y; and z; equals the covariance between x; and z; 
plus the covariance between y; and z;. It is 


0.0002016 + 0.0000432 = 0.0002448 


The volatility of x; + y; is 2.31% from Problem 10.15. The correlation between 
x; + y; and z; is 


0.0002448 


—————. = 0.662 
0.016 X 0.0231 


Note that the volatility of the S&P 500 drops out in this calculation. 
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11.8 V, 
V; 0.25 0.5 0.75 
0.25 0.095 0.163 0.216 
0.50 0.163 0.298 0.413 


0.75 0.216 0.413 0.595 


11.9 The formulas are 


= — 2 =e 
& = 21, € = P221 + 204/1- Pi,» €, = 0,2, 0,2, + 0323 


where 


/ 2 2 2 
OO, = Piz > Q1 P12 + OX 1 — pi, = Pog » aj +a, +a = 1 


This means that 


aj = P13 > pa A e. 03 = y1- -o 
2 
yiTPR 


11.10 Tail dependence is the tendency for extreme values for two or more variables to 
occur together. The choice of the copula affects tail dependence. For example, the 
Student’s t-copula gives more tail dependence than the Gaussian copula. 

11.11 Sample from a bivariate Student’s t distribution as in Figure 11.5. Convert each 
sample to a normal distribution on a percentile-to-percentile basis. 

11.12 The probability that V} < 0.1 is 0.05. The conditional probability that V, < 0.1 
is 0.006/0.05 = 0.12. The conditional probability that V, < 0.2 is 0.017/0.05 = 
0.34, and so on. 

11.13 When V, = 0.2, U, = —0.8416. From the properties of the bivariate normal 
distribution, the median of U, is —0.5 X 0.8416 = —0.4208. The percentile of 
the conditional median is therefore N(—0.4208) = 0.3369. This translates into 
a median value for V3 of x where 2x? = 0.3369. The conditional median value 
for V, is therefore 0.4105. 

11.14 In this case, 


zi =i 
OEN (= (0.015) + VO.2N c=) er 


1-0.2 


The loss rate that we are 99.5% certain will not be exceeded is 12.7%. 
11.15 The maximum likelihood estimates for the probability of default and the copula 
correlation are 3.92% and 11.18%, respectively. 
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Chapter 12 


12.1 VaR is the loss that is not expected to be exceeded with a certain confidence 
level. Expected shortfall is the expected loss conditional that the loss is worse 
than the VaR level. Expected shortfall has the advantage that it always satisfies the 
subadditivity (diversification is good) condition. 

12.2 A spectral risk measure is a risk measure that assigns weights to the quantiles of 
the loss distribution. For the subadditivity condition to be satisfied, the weight 
assigned to the qth quantile must be a non-decreasing function of q. 

12.3 There is a 5% chance that you will lose $6,000 or more during a one-month 
period. 

12.4 Your expected loss during a bad month is $6,000. Bad months are defined as the 
months where returns are less than the five-percentile point on the distribution 
of monthly returns. 

12.5 (a) $1 million. 

(b) The expected shortfall is 0.9 x 10 + 0.1 X 1 or $9.1 million. 

(c) There is a probability of 0.009? = 0.000081 ofa loss of $20 million, a probabil- 
ity of 2 X 0.009 X 0.991 = 0.017838 of a loss of $11 million, and a probability 
of 0.9917 = 0.982081 of a loss of $2 million. The VaR when the confidence 
level is 99% is therefore $11 million. 

(d) The expected shortfall is (0.000081 x 20 + 0.009919 x 11)/0.01 = $11.07 
million. 

(e) Because 1 +1 < 11, the subadditivity condition is not satisfied for VaR. 
Because 9.1 + 9.1 > 11.07, it is satisfied for expected shortfall. 

12.6 (a) 2 x 1.96 = $3.92 million, (b) /5 x 2 x 1.96 = $8.77 million, (c) /5 x 2x 
2.33 = 10.40 million. 

12.7 (b) becomes $9.96 million and (c) becomes $11.82 million. 

12.8 Marginal VaR is the rate of change of VaR with the amount invested in the ith 
asset. Incremental VaR is the incremental effect of the ith asset on VaR (i.e., the 
difference between VaR with and without the asset). Component VaR is the part 
of VaR that can be attributed the ith asset (the sum of component VaRs equals 
the total VaR). 

12.9 The probability of 17 or more exceptions is 1-BINOMDIST(16,1000,0.01, 
TRUE) or 2.64%. The model should be rejected at the 5% confidence level. 

12.10 Bunching is the tendency for exceptions to be clustered together rather than 
occurring randomly throughout the time period considered. 

12.11 We are interested in the variance of AP, + AP, +... + AP. This is ae o? + 
2 pa P00, where ©; is the standard deviation of AP; and p; is the correlation 
between AP, and AP. In this case, 6; = © for all i and Pj = p= when i > j. After 
further algebraic manipulations, this leads directly to equation (12.5). 

12.12 The standard deviation in three months is 24/3 = $3.464 million. Also, 
N7!(0.98) = 2.054. The 98% three-month VaR is therefore 3.464 x 2.054 = 
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$7.11 million. The 98% three-month ES is 3.464 x exp(—2.0547/2)/(1/ 21 x 
0.02) = $8.39 million. 


Chapter 13 


13.1 The assumption is that the statistical process driving changes in market variables 
over the next day is the same as that over the past 500 days. 
13.2 


, A1 
THA -N0 AA = Ã—_ 
( MK ) 1AA +... +A! 
This shows that as À approaches 1, the weights approach 1/n. 
13.3 The standard error of the estimate is 


1 0.05 x 0.95 _ 0.69 
0.01 1,000 


The standard error of the VaR estimate is $0.69 million. 

13.4 (a) The 95% one-day VaR is the 25th worst loss. This is $156,511. (b) The 95% 
one-day ES is the average of the 24 highest losses. It is $209,310. (c) The 97% 
one-day VaR is the 15th worst loss. This is $172,224. (d) The 97% one-day ES is 
the average of the 14 highest losses. It is $240,874. 

13.5 In worksheet 2 (Scenarios), the portfolio investments are changed to 2,500 in cells 
L2:O2. The losses are then sorted from the largest to the smallest. The fifth worst 
loss is $238,526. This is the one-day 99% VaR. The average of the four worst 
losses is $372,872. This is the one-day 99% ES. 

13.6 The value of A in cell F2 of worksheet 4 (Scenarios with Weights) is changed 
from 0.995 to 0.99. The losses are then sorted from the largest to the smallest. 
The cumulative weight for the largest loss ($477,841, Scenario 494) is 0.00948. 
The cumulative weight for the second largest loss ($345,435, Scenario 339) is 
0.01147. The one-day 99% VaR is therefore the second worst loss or $345,435. 
The one-day ES is 0.948 X 477,841 + 0.052 x 345,435 or $470,917. 

13.7 The value of A in cell K1 of worksheet 9 (Losses Adjusted for Loss SD) is changed 
from 0.94 to 0.96. The adjusted losses are then sorted from the highest to the 
lowest. The fifth worst loss is $541,337. This is the new one-day 99% VaR. The 
average of the four highest losses is $707,146. This is the new one-day 99% ES. 

13.8 This is 


= -1/0.436 
22 iG aag= 
500 32.532 


or 0.001623. 
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13.9 The VaR ($000s) is 


—0.436 
iene { [2a = 0.97) = i} = 173.6 


0.436 22 


or $173,600. 

13.10 The maximum likelihood estimates of E and P become 0.353 and 34.05. The one- 
day 99% VaR becomes $230,725 and the one-day 99.9% VaR becomes $452,831. 
The one-day 99% ES becomes $327,336 and the one-day 99.9% ES becomes 
$670,499. 

13.11 The ranked losses from the volatility scaling procedure must be transferred to 
worksheet 11 (Extreme Val Theory). With u = 400, we have n, = 17 and the max- 
imum likelihood values of € and B are 0.438 and 82.838. The one-day 99% VaR 
is $534,100. The one-day 99.9% VaR is $1,096,661. The one-day ES with 99% 
confidence is $785,819. The one-day ES with 99.9% confidence is $1,786,335. 
The probability of a loss greater than $600,000 is 


= —1/0.438 
A gag aay = 0.00655 
500 82.838 
Chapter 14 


14.1 The standard deviation of the daily change in the investment in each asset is $1,000. 
The variance of the portfolio’s daily change is 


1,000? + 1,000? + 2 x 0.3 x 1,000 x 1,000 = 2,600,000 


The standard deviation of the portfolio’s daily change is the square root of this, or 


$1,612.45. The five-day 97% value at risk is therefore 1.8808 x v5 x 1,612.45 = 
$6,781. The five-day 97% ES is 


e7 1-8808°/2 
V5 x 1,612.45 x ————— = $8,178 
V21 x 0.03 


14.2 The two alternative procedures mentioned in the chapter for handling interest 
rates when the model-building approach is used to calculate VaR involve (a) the 
use of principal components analysis and (b) describing the term structure with 
10 vertices. 

14.3 The 10-year vertex has a sensitivity of $30,000 because a one-basis-point increase 
in that rate leads to an increase of 0.6 basis points in the 12-year rate. Similarly, the 
15-year vertex has a sensitivity of $20,000. The other vertices have no sensitivity. 
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14.4 AP = 3.9Ax. The standard deviation of AP is 3.9 x 0.007 = 0.027. It follows that 
the 10-day 99% VaR for the portfolio is 0.027 X 2.326 x /10 = 0.20. 

14.5 The relationship is AP = 3.9Ax + 0.5 X 4.3 X Ax? or AP = 3.9Ax + 2.15Ax°. 

14.6 The variance of the change in the value of the portfolio in one day is (in $000s) 


(10x 7)? + (8X8)? +2xX10X7X8X8X 0.8 = 16,164 


Because NT! (0.98) = 2.054, the ES is therefore 


V 16,164 x 5e7205#/2 
/2n x 0.02 


= 688.2 


or $688,200. 

14.7 Suppose that the risk measure (VaR or ES) for a portfolio is its standard deviation 
multiplied by B. The risk weight of a risk factor is its standard deviation multiplied 
by P. The risk sensitivity for a portfolio is the amount by which the portfolio 
changes for a 1% or one basis point change in the risk factor. 

14.8 The daily variance of the portfolio is 


67 x 207 + 47 x 87 = 15,424 


and the daily standard deviation is 15,424 = $124.19. Since N7!(0.9) = 1.282, 
the five-day 90% value at risk is 


124.19 xX y5 X 1.282 = $355.89 


14.9 (a) 2.0, (b) 43.9. 

14.10 The delta of the options is the rate of change of the value of the options with 
respect to the price of the asset. When the asset price increases by a small amount, 
the value of the options decreases by 30 times this amount. The gamma of the 
options is the rate of change of their delta with respect to the price of the asset. 
When the asset price increases by a small amount, the delta of the portfolio 
decreases by five times this amount. The delta and gamma with respect to pro- 
portional changes are —30 x 20 = —600 and —5 X 20 x 20 = —2,000. In this case, 
o = 0.01 so that E(AP) = —0.10, E(AP?) = 36.03, and E(A P?) = —32.415. The 
mean change in the portfolio value in one day is —0.1 and the standard deviation 


of the change in one day is y 36.03 — 0.17 = 6.002. The skewness is 


—32.415 — 3 x 36.03 x (-0.1) + 2 x (-0.1)? _ 21.608 
6.0023 ~ 216.180 


= —0.10 
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Using only the first two moments, the one-day 99% value at risk is $14.08. When 
three moments are considered in conjunction with a Cornish-Fisher expansion, 
it is $14.53. 

14.11 Define © as the volatility per year, AS as the change in the asset price in one day, 
and Ao as the change in the volatility in one day. The relationship between AP, 
AS and Ao is 


AP = —30AS — 0.5 x 5 x (AS)? — 2Ac 
or 
AP = —30 x 20Ax — 0.5 x 5 x 207(Ax)? — 2Ao 
(where Ax = AS/S), which simplifies to 
AP = —600Ax — 1,000(Ax)* — 2Ao 


The change in the portfolio value now depends on two risk factors. 

14.12 The change in the value of an option is not linearly related to the changes in the 
values of the underlying variables. When the changes in the values of underlying 
variables are normal, the change in the value of the option is non-normal. The 
linear model assumes that it is normal and is, therefore, only an approximation. 

14.13 The contract is a long position in a sterling bond combined with a short position 
in a dollar bond. The value of the sterling bond is 1.53e~°-*°, or $1.492 million. 
The value of the dollar bond is 1.5¢79-°*>, or $1.463 million. The variance of 
the change in the value of the contract in one day is 


1.4927 x 0.00062 + 1.4637 x 0.00057 — 2 x 0.8 x 1.492 x 0.0006 x 1.463 
x 0.0005 = 0.000000288 


The standard deviation is therefore $0.000537 million. The 10-day 99% VaR is 


0.000537 x 10 x 2.33 =$0.00396 million. 

14.14 The alphas should be changed to 2,500. This changes the one-day 99% VaR to 
$226,836 and the one-day ES to $259,878 when volatilities and correlations are 
estimated using the equally weighted model. It changes the one-day 99% VaR to 
$487,737 and the one-day 99% ES to $558,783 when EWMA with A = 0.94 is 
used. 

14.15 This changes the one-day 99% VaR from $471,025 to $389,290 and the one-day 
ES from $539,637 to $445,996. 

14.16 One way is to assume that the correlation between a vertex of the first term struc- 
ture and a vertex of the second term structure is the same for all vertices. Another 
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is to calculate the impact on the portfolio of changes to each term structure sep- 
arately and to use the correlation to aggregate the two impacts. 


Chapter 15 


15.1 The removal of a competitor may be beneficial. However, banks enter into many 
contracts with each other. When one bank goes bankrupt, other banks are liable 
to lose money on the contracts they have with the bank. Also, other banks will be 
adversely affected if the bankruptcy reduces the public’s overall level of confidence 
in the banking system. 

15.2 Deposit insurance means that depositors are safe regardless of the risks taken by 
their financial institution. It is liable to lead to financial institutions taking more 
risks than they otherwise would because they can do so without the risk of losing 
deposits. This in turn leads to more bank failures and more claims under the 
deposit insurance system. Regulation requiring the capital held by a bank to be 
related to the risks taken is necessary to avoid this happening. 

15.3 The credit risk on the swap is the risk that the counterparty defaults at some future 
time when the swap has a positive value to the bank. 

15.4 The value of a currency swap is liable to deviate further from zero than the value 
of an interest rate swap because of the final exchange of principal. As a result, the 
potential loss from a counterparty default is higher. 

15.5 There is some exposure. If the counterparty defaulted now, there would be no 
loss. However, interest rates could change so that at a future time the swap has 
a positive value to the financial institution. If the counterparty defaulted at that 
time, there would be a loss to the financial institution. The capital under Basel I 
would, from Table 15.2, be 0.5% of the swap’s principal. 

15.6 The risk-weighted assets for the three transactions are (a) $1.875 million, (b) $2 
million, and (c) $3 million, for a total of $6.875 million. The capital required is 
0.08 X 6.875 or $0.55 million. 

15.7 The net replacement ratio (NRR) is 2.5/4.5 = 0.556. The credit equivalent 
amount is 2.5 + (0.4 + 0.6 x 0.556) x 9.25 or $9.28 million. The risk-weighted 
assets are $4.64 million, and the capital required is $0.371 million. 

15.8 In this case, there is no value to the netting provisions. 

15.9 This converts the estimated capital requirement to estimated risk-weighted assets. 
Capital required equals 8% of risk-weighted assets. 

15.10 The trading book consists of instruments that are actively traded and marked 
to market daily. The banking book consists primarily of loans that are held to 
maturity and not marked to market daily. The effect of the change is to move 
the client’s borrowings from the banking book to the trading book. This typically 
reduces capital requirements. (However, the incremental risk charge in Basel IL.5, 
which is discussed in Chapter 16, brings capital requirements back up to where 
they were before.) 
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15.11 Under Basel I, the capital charged for lending to a corporation is the same regard- 
less of the credit rating of the corporation. This leads to a bank’s return on capital 
being relatively low for lending to highly creditworthy corporations. Under Basel 
II, the capital requirements of a loan are tied much more carefully to the credit- 
worthiness of the borrower. As a result, lending to highly creditworthy companies 
became more attractive. 

15.12 Regulatory arbitrage involves entering into a transaction or series of transac- 
tions, without affecting the risks being taken, in order to reduce regulatory capital 
requirements. 

15.13 EAD is the estimated exposure at default. LGD is the loss given default, which 
is the proportion of the exposure that will be lost if a default occurs. WCDR 
is the one-year probability of default in a bad year that occurs only one time 
in 1,000. PD is the probability of default in an average year. MA is the maturity 
adjustment, which allows for the fact that, in the case of instruments lasting longer 
than a year, there may be losses arising from a decline in the creditworthiness of 
the counterparty during the year as well as from a default during the year. 

15.14 Under the simple approach, the risk weight of the counterparty is replaced by the 
risk weight of the collateral for the part of the exposure covered by the collateral. 
Under the comprehensive approach, the exposure is adjusted for possible increases 
and the collateral is adjusted for possible decreases in value. The counterparty’s risk 
weight is applied to the excess of the adjusted exposure over the adjusted collateral. 

15.15 The standardized approach uses external ratings to determine capital requirements 
(but in a more sophisticated way than in Basel I). In the IRB approach, the Basel II 
correlation model is used with PD being determined by the bank. In the advanced 
IRB approach, the Basel II correlation model is used with PD, LGD, EAD, and M 
being determined by the bank. 

15.16 In the basic indicator approach, total capital is 15% of the average total annual 
gross income. In the standardized approach, gross income is calculated for dif- 
ferent business lines, and capital as a percentage of gross income is different for 
different business lines. In the advanced measurement approach, the bank uses 
internal models to determine the 99.9% one-year VaR. 

15.17 In this case, p = 0.1216, WCDR = 0.0914, and the capital requirement is 200 X 
0.7 X (0.0914 — 0.01) or $11.39 million. At least half of this must be Tier 1. 

15.18 The probability of five or more exceptions is 1-BINOMDIST(4,250,0.01, 
TRUE) or 10.8%. It could be argued that regulators are using a confidence level 
of about 10% (rather than the more usual 5%) in choosing to reject a VaR model. 


Chapter 16 


16.1 The three major components of Basel II.5 are: the calculation of stressed VaR, a 
new incremental risk charge, and a comprehensive risk measure for instruments 
dependent on credit correlation. 
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16.2 The six major components of Basel III are: capital definitions and requirements, 
the capital conservation buffer, the countercyclical buffer, the leverage ratio, liq- 
uidity ratios, and counterparty credit risk. 

16.3 VaR, as it is usually defined, is calculated from the most recent one to four years 
of daily data. Stressed VaR is calculated from a 250-day period in the past that 
would be particularly bad for the bank’s current portfolio. 

16.4 The incremental risk charge is calculated as the one-year 99.9% VaR for losses 
from credit instruments in the trading book. It takes account of rating changes 
and liquidity horizons. It was introduced because instruments in the trading book 
often attracted less capital than equivalent instruments in the banking book. 

16.5 The capital requirements for the AAA-rated ABS are 1.6% of principal whereas 
the capital requirements for the AAA-rated ABS CDO are 3.2% of principal. 

16.6 Tier 1 equity capital has increased from 2% to 7%, and the definition of equity 
capital has been tightened. 

16.7 (a) 40%, (b) 20%. 

16.8 In the leverage ratio, the denominator is not risk-weighted assets. It is total assets 
on the balance sheet without risk weighting plus derivatives exposures (calculated 
as in Basel I) and some off-balance-sheet items such as loan commitments. 

16.9 The liquidity coverage ratio (LCR) is the ratio of high-quality liquid assets to net 
cash outflows during a stressed period of 30 days. The net stable funding ratio 
(NSFR) is the ratio of a weighted sum of the items on the “liabilities and net 
worth” side of the balance sheet divided by a weighted sum of the items on the 
“assets” side of the balance sheet. 

16.10 The amount of stable funding would change to 81.6 and the NSFR would 
become 81.6/74.25 = 110%. 

16.11 The credit value adjustment (CVA) is a charge to income reflecting expected 
losses from counterparty defaults in derivatives transactions. The new regulations 
require the exposure of CVA to credit spreads to be a component of market risk 
capital. 

16.12 CoCo bonds are automatically converted into equity when a predefined trigger 
indicates that the bank’s capital is low. They are attractive to banks because they 
do not affect return on equity prior to conversion. They are attractive to regula- 
tors because they are a source of capital that can absorb losses in stressed market 
conditions. 


Chapter 17 


17.1 When an ISDA master agreement is used, each side to an OTC derivatives trans- 
action is taking the risk that the other side will default. When a CCP is used, the 
CCP acts as an intermediary so that each side has a transaction with the CCP. 

17.2 Standard transactions between financial institutions must be cleared through 
CCPs. For transactions between financial institutions that are cleared bilaterally, 
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both initial margin and variation margin are required. The initial margin must 
cover, with 99% confidence, market moves during a 10-day period of stressed 
market conditions. 

17.3 The amount of collateral (typically cash or Treasury securities) that has to be 
provided by financial institutions for their derivatives books will increase. 

17.4 The haircut applied to the market price of a financial instrument is the percentage 
reduction in its price that is made to determine its value for collateral purposes. 
For example, if an instrument is worth $100 and is subject to a 10% haircut, it can 
be used to satisfy a $90 collateral requirement. 

17.5 An event of default occurs when one side to a derivatives transaction that is sub- 
ject to an ISDA master agreement fails to post collateral or make a payment as 
required. An early termination follows an event of default and occurs when the 
nondefaulting party terminates all outstanding transactions with the defaulting 
party. 

17.6 The exposure of A including its exposure to the CCP is reduced to 70. The 
exposure of A excluding its exposure to the CCP remains 0. The exposure of B 
including its exposure to the CCP is reduced to 100. The exposure of B excluding 
its exposure to the CCP is reduced to 70. The exposures of C are unaffected. 
The average exposure of the three parties, including the exposures to the CCP, is 
reduced from 110 to 86.7. The average exposure, excluding the exposures to the 
CCP is reduced from 70 to 53.3. 

17.7 The company will lose money if (a) the CCP defaults or (b) one of the other 
members defaults and not enough margin and default fund contribution has been 
provided by that member to cover losses when its positions are closed out. 

17.8 Collateral requirements are normally calculated from the net mark-to-market 
value of all outstanding transactions. In determining settlement amounts, trans- 
actions with a positive value are netted against those with a negative value. 

17.9 Rehypothecation occurs when collateral posted by A with B is used by B to meet 
collateral demands from C. 

17.10 The activities of CCPs are much simpler than those of banks. 
17.11 The mid-market values of transactions to the non-defaulting party at the time of 
the early termination adjusted upward by half the bid—offer spreads. 


Chapter 18 


18.1 In Basel I, market risk capital is calculated from the current 99% ten-day VaR, 
which is assumed to be /10 times the current 99% one-day VaR. In Basel II.5, 
there are two components of the capital. One is calculated from the current 99% 
VaR (as in Basel I); the other is calculated from the stressed 99% 10-day VaR. Each 
component is assumed to be /10 times the corresponding one-day VaR. In the 
FRTB, capital is calculated from the 97.5% stressed ES with the time horizon for 
a variable being dependent on its liquidity. 


18.2 The 97.5% ES is p + oe7!°°/2/(4/2n x 0.025) = p + 2.3380. 
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18.3 Consider how historical simulation works. The change from Day 0 to Day 10 and 
the change from Day 1 to Day 11 have nine days in common. They are therefore 
not independent. Indeed we would expect the two changes to be similar in most 
situations. 

18.4 A theoretical advantage of ES over VaR is that it satisfies the subadditivity (diver- 
sification) condition in Section 12.5. A practical advantage is that is quantifies tail 
risk better. 

18.5 The trading book consists of items that are held for trading and marked to market 
daily. The banking book consists of items that are expected to be held to maturity. 
Capital requirements are calculated in quite different ways for the two books, and 
regulators do not want to give banks a way of reducing capital by allowing them 
to choose the book they put an item in. 

18.6 Credit spread risk is treated in the same way as other market risks. Jump-to-default 
risk is handled similarly to the credit risk associated with items in the banking 
book. 


Chapter 19 


19.1 The 10 investment grade ratings used by Moody’s are: Aaa, Aal, Aa2, Aa3, A1, A2, 
A3, Baal, Baa2, and Baa3. 

19.2 The 10 investment grade ratings used by S&P are: AAA, AA+, AA, AA-, A+, A, 
A-, BBB+, BBB, and BBB-. _ = 

19.3 From equation (19.2), the average hazard rate, A, satisfies 0.03573 = 1 — eX! so 
that à = —1n(0.96427) = 0.0364. The average hazard rate is 3.64% per year. 

19.4 Conditional on no default by year 2, the probability of a default in year 3 is 


(0.04492 — 0.02583) /(1 — 0.02583) = 0.01960 


Average hazard rate for the third year, X, satisfies 1 — e**! = 0.01960. It is 1.98% 
per year. 

19.5 The seller receives 300,000,000 x 0.0060 x 0.5 =$900,000 at times 0.5, 1.0, 1.5, 
2.0, 2.5, 3.0, 3.5, and 4.0 years. The seller also receives a final accrual payment of 
about $300,000 (=$300,000,000 x 0.0060 x 2/12) at the time of the default (four 
years and two months). The seller pays 300,000,000 x 0.6 = $180,000,000 at the 
time of the default. 

19.6 Sometimes there is physical settlement of a credit default swap, and sometimes 
there is cash settlement. In the event of a default when there is physical settlement, 
the buyer of protection sells bonds issued by the reference entity for their face 
value. Bonds with a total face value equal to the notional principal can be sold. 
In the event of a default when there is cash settlement, a calculation agent or 
an auction determines the value of the cheapest-to-deliver bonds issued by the 
reference entity a specified number of days after the default event. The cash payoff 
is then based on the excess of the face value of these bonds over the estimated 
value. 
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19.7 Risk-neutral default probabilities are backed out from credit spreads. Real-world 
default probabilities are calculated from historical data. Risk-neutral default prob- 
abilities should be used for valuation. Real-world default probabilities should be 
used for scenario analysis and credit VaR calculations. 

19.8 The payoff is L(1 — R) where L is the notional principal and R is the recovery 
rate. 

19.9 From equation (19.3), the average hazard rate over the three years is 0.0050/(1 — 
0.3) = 0.0071 or 0.71% per year. 

19.10 From equation (19.3), the average hazard rate over five years is 0.0080/(1 — 0.4) 
or 1.333% per year. Similarly, the average hazard rate over three years is 1.1667% 
per year. This means that the average hazard rate for years 4 and 5 is (5 X 1.333 — 
3 X 1.1667) /2 = 1.58%. 

19.11 Real-world probabilities of default should be used for calculating credit value at 
risk. Risk-neutral probabilities of default should be used for adjusting the price of 
a derivative for default. 

19.12 The recovery rate for a bond is the value of the bond shortly after the issuer 
defaults as a percentage of its face value. 

19.13 The first number in the second column of Table 19.5 is calculated as 


= In(1 — 0.00195) = 0.00028 


or 0.028% per year. The second number in the second column of Table 19.5 is 
calculated as 


-2 In(1 — 0.00525) = 0.00075 
and so on. The final number is 


3 In(1 — 0.42132) = 0.07814 


The numbers in the fourth column of Table 19.6 are calculated by multiplying 
the hazard rates (expressed in basis points) in Table 19.5 by 0.6. For Caa, this gives 
781.4 X 0.6 = 469. 

19.14 The no-default value of the bond 


Je70.03x0.5 4 99-0.03K1.0 4 4 102e700x40 — 103.66 


The market price is 96.16. An analysis similar to that in Table 19.3 shows that if Q 
is the default probability per year, the loss from defaults is 272.69Q. The implied 
probability of default is therefore given by solving 103.66 — 96.16 = 272.69Q. It 
is 2.74% per year. 
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19.15 If Q, is the default probability at times 0.5, 1.5, and 2.5 years, an analysis similar 
to that in Table 19.3 shows that the present value of the loss from defaults for 
the first bond is 178.31Q, per $100 of face value. The default-free value of the 
bond is 


AX 070-035X1 4. x 97 0035%2 4194 x 97 0.035%3 = 101.23 


and the market value is 98.35. It follows that 178.31Q, = 101.23 — 98.35 so that 
Q, = 0.0157 or 1.57%. If Q, is the probability of default at times 3.5 and 4.5, an 
analysis similar to that in Table 19.3 gives the present value of the loss from default 
for the second bond to be 180.56Q, + 108.53Q,. The default-free value of the 
second bond is 


4e70-035x1 + 4 x 7 0:035x2 ee E 104 x 7 0:035x5 = 101.97 
The market value is 96.24. It follows that 180.56Q, + 108.53Q, = 101.97 — 
96.24. Substituting for Q,, we find that Q, = 0.0260, or 2.60%. 

19.16 We can assume that the principal is paid and received at the end of the life of 
the swap without changing the swap’s value. If the spread were zero, the present 
value of the floating payments per dollar of principal would be 1. The payment 
of LIBOR plus the spread therefore has a present value of 1 + V. The payment 
of the bond cash flows has a present value per dollar of principal of B*. The initial 
payment required from the payer of the bond cash flows per dollar of principal is 
1 — B. (This may be negative; an initial amount of B — 1 is then paid by the payer 
of the floating rate.) Because the asset swap is initially worth zero, we have 

1+V=B'+1-B 
so that 
V = B -B 
19.17 The value of the debt in Merton’s model is V} — Eg or 
Vo — VoN (dy) + De N (ds) = De“ N(d) + VoN (dh) 
If the credit spread is s, this should equal De~""*)" so that 
De)" = De" N(dy) + VN (—d;) 


Substituting De"? = LV), we get 


Le“? = LN(d,) + N(—d,) 
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so that 
s = —In[N(d,) + N(-d,)/L]/T 


19.18 In this case Ey = 2,0, = 0.50, D = 5, r = 0.04, and T = 1. Solving the simulta- 
neous equations gives V} = 6.80 and op = 14.82. The probability of default is 
N(—d,) or 1.15%. 

19.19 At the end of each quarter for the first four years, the seller receives $1 million. 
The seller pays $70 million after four years and two months. The seller receives a 
final accrual payment of $666,667. 

19.20 A credit default swap insures a corporate bond issued by the reference entity 
against default. Its approximate effect is to convert the corporate bond into a risk- 
free bond. The buyer of a credit default swap has therefore chosen to exchange a 
corporate bond for a risk-free bond. This means that the buyer is long a risk-free 
bond and short a similar corporate bond. 

19.21 Payoffs from credit default swaps depend on whether a particular company 
defaults. Arguably some market participants have more information about this 
than other market participants. (See Business Snapshot 19.2.) 

19.22 Suppose that the principal is $100. The present value of the bond if it were risk- 
free would be 


25g a 2.570001 E., 4 2.570043 4 102.50700 = 95,3579 


The present value of the expected loss from defaults is therefore 95.3579 — 90 = 
5.3579. The asset swap is structured so that the $10 is paid initially. After that, 
$2.50 is paid every six months. In return, LIBOR plus a spread is received on the 
principal of $100. The present value of the fixed payments is 


10 + 2.5e7006x05 4 2,5670001 rer ae aaa 4 102.5670005 
= 105.3579 


The spread over LIBOR must therefore have a present value of 5.3579. The 
present value of $1 received every six months for five years is 8.5105. The spread 
received every six months must therefore be 5.3579/8.5105 = $0.6296. The asset 
swap spread is therefore 2 x 0.6296 = 1.2592% per annum. This problem provides 
an illustration of the result in Problem 19.16. 


Chapter 20 


20.1 The new transaction will increase the bank’s exposure to the counterparty if it 
tends to have a positive value whenever the existing transactions have a net positive 
value and a negative value whenever the existing transactions have a negative value. 
However, if the new transaction tends to offset the existing transactions, it is likely 
to have the incremental effect of reducing credit risk. 
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20.2 A company’s own equity would not be good collateral. When the company 
defaults on the transactions it has with you, its equity is likely to be worth very 
little. 

20.3 The statements in (a) and (b) are true. The statement in (c) is not. Suppose that 
Vy and vy are the exposures to X and Y. The expected value of vy + vy is the 
expected value of vy plus the expected value of vy. The same is not true of 95% 
confidence limits. 

20.4 Assume that defaults happen only at the end of the life of the forward contract. In 
a default-free world, the forward contract is the combination of a long European 
call and a short European put where the strike price of the options equals the 
delivery price and the maturity of the options equals the maturity of the forward 
contract. If the no-default value of the contract is positive at maturity, the call has 
a positive value and the put is worth zero. The impact of defaults on the forward 
contract is the same as that on the call. If the no-default value of the contract 
is negative at maturity, the call has a zero value and the put has a positive value. 
In this case, defaults have no effect. Again, the impact of defaults on the forward 
contract is the same as that on the call. It follows that the contract has a value 
equal to a long position in a call that is subject to default risk and a short position 
in a default-free put. 

20.5 The Black-Scholes—Merton price must be multiplied by e??? = 0.964. Black- 
Scholes—Merton overstates the price by about 3.6%. 

20.6 If many contracts entered into by Company X are subject to the same default 
trigger, the effect of the default trigger may be to increase risk. If the company 
is downgraded so that the default trigger is activated, those of its counterparties 
who have contracts with negative values to Company X will request collateral. As 
a result, Company X is likely to experience liquidity problems and may be forced 
into bankruptcy. 

20.7 A dealer is likely to have wrong-way risk when the counterparty is entering into a 
CDS selling credit protection to the dealer or when the counterparty is speculat- 
ing. The dealer is likely to have right-way risk when the counterparty is entering 
into a CDS buying credit protection from the dealer or when the counterparty is 
partially hedging an existing position. 

20.8 The cure period is the period that is assumed between an event of default and an 
early termination in CVA calculations. 

20.9 All outstanding transactions between Party A and Party B are considered to be a 
single transaction if Party B defaults. It is therefore the total value of all outstanding 
transactions than determines the loss, if any, to Party A. 

20.10 When a bank is experiencing financial difficulties, its probability of default 
increases. This increases the expected cost to its counterparties from a default 
by the bank. The latter is known as DVA and is an accounting entry that increases 
the value of transactions to the bank. 

20.11 The part of CVA relating to the credit spread changes of counterparties is a com- 
ponent of market risk under Basel II. 
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20.12 There will be losses in the CVA model if the value of the portfolio moves in 
the dealer’s favor during a 15-day period immediately prior to the midpoint of 
a time interval. It is assumed that either the dealer will fail to receive additional 
collateral from the counterparty or the counterparty will fail to return collateral 
it has received from the dealer. 


Chapter 21 


21.1 In Vasicek’s model and Credit Risk Plus, a credit loss is recognized when a default 
occurs. In CreditMetrics, both downgrades and defaults lead to credit losses. In 
Vasicek’s model, a Gaussian copula model of time to default is used. In Credit 
Risk Plus, a probability distribution is assumed for the default rate per year. In 
CreditMetrics, a Gaussian copula model is used to define rating transitions. 

21.2 The constant level of risk assumption assumes that after a certain period of time, f, 
an instrument, X, if it has deteriorated is replaced with an instrument, Y, that has 
the same risk as X had originally. After a further period of time of length t, the 
instrument if it has deteriorated is again replaced with an instrument and has the 
same risk as X had originally, and so on. 

21.3 The probability of an Aaa rating staying Aaa over two years is 82.77%. The prob- 
ability of it moving to Aa is 15.12%. 

21.4 The probability of an Aaa rating staying Aaa over six months is 95.35%. The 
probability of it moving to Aa is 4.40%. 

21.5 Movements in credit spreads for all companies over the next day could be assumed 
to be a random sample from their movements over the past 500 days. The disad- 
vantages of this approach are that the companies have zero chance of defaulting 
and accurate daily credit spread data may not be available for all companies. 

21.6 Using the binomial distribution, the probability of six or more defaults is 0.0005. 

21.7 In this case, we must average the cumulative binomial distributions for 0.5% and 
1.5% loss probabilities. The probability of six or more defaults is 0.0021. This 
shows that introducing some correlation increases the tail risk. 

21.8 The autocorrelation is quite high at 0.546. This suggests that the credit VaR esti- 
mate should take account of recent default experience. If the default rate was high 
last year, it is more likely to be high this year. 


Chapter 22 


22.1 Scenarios can be generated by (a) making large changes to key variables such as 
interest rates, equity prices, and so on; (b) making percentage changes in all market 
variables that are the same as those that occurred on particular extreme days in the 
past; and (c) asking a committee of senior management to generate the scenarios. 

22.2 Reverse stress testing involves using an algorithm to search for scenarios that lead 
to big losses. It is used to help identify appropriate scenarios for stress testing. 
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22.3 Financial institutions might consider that regulators will require more capital if 
scenarios leading to large losses are considered. 

22.4 Traffic light options were options that provided payoffs when the scenarios con- 
sidered by insurance company regulators occurred. The danger is that, when a 
financial institution buys traffic light options, it is protecting itself against a too 
narrow range of adverse scenarios. The financial institution might not be pro- 
tected against a scenario that is similar to, but not exactly the same as, the scenarios 
covered by the traffic light options. 

22.5 Senior managers are in the best position to develop scenarios for stress testing. 
Their involvement in the development of scenarios makes it more likely that they 
will take the results of stress testing seriously and incorporate stress testing in their 
strategic decision making. 

22.6 The advantage is that the same scenarios are considered by different banks and 
systemic risks can be evaluated. The scenarios might be worse than those consid- 
ered by the banks themselves (see Problem 22.3). The disadvantage is that it might 
lead to the banks themselves not spending as much time as they should on stress 
testing. 

22.7 An objective probability is calculated from data. A subjective probability is a 
“degree of belief” and reflects a person’s judgment. 

22.8 The total probability of the stressed scenarios is 1.5%. The probability associated 
with the historical simulation scenarios is therefore 98.5%. Each historical simula- 
tion scenario has a probability of 0.197. When the scenarios and their probabilities 
are ranked, we see that the VaR with a 99% confidence limit is $284,204. (For a 
loss of $340,000, the cumulative probability is 0.00943. For a loss of $284,204, the 
cumulative probability is 0.01141.) 

22.9 In this case, the position values are 941.34, -164.39, -1,349.94, and —78.36. The 
worst-case scenario is where the asset price is 60 and its volatility is 30%. This 
leads to a loss of $341.39. 


Chapter 23 


23.1 The definition includes all internal risks and external risks except those related to 
reputational risk and risks resulting from strategic decisions. 

23.2 Based on the results reported by Shih, the loss would be 100 x 3°”? or $128.7 
million. 

23.3 Prob(v > x) = Kx~°®. When x = 20 the probability is 0.1. This means that K = 
1.0986. The probabilities of the specified losses being exceeded are (a) 5.74%, (b) 
3.30%, and (c) 1.58%. 

23.4 Moral hazard is handled by deductibles and by making premiums dependent on 
past claims. Adverse selection is handled by finding out as much as possible about 
a driver before insurance is granted and then modifying premiums as more infor- 
mation on the driver becomes available. 
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23.5 CEOs must prepare a statement asserting that the financial statements are accurate. 
They must return bonuses in the event that there is a restatement of financial 
statements. 

23.6 If a trader operates within established risk limits and takes a loss, this is part of 
market risk. If risk limits are violated, the loss becomes classified as an operational 
risk. 

23.7 (a) It is unlikely that an individual would not look after his or her health because of 
the existence of a life insurance contract. But it has been known for the beneficiary 
of a life insurance contract to commit murder to receive the payoff from the 
contract! (b) Individuals with short life expectancies are more likely to buy life 
insurance than individuals with long life expectancies. 

23.8 External loss data are data relating to the losses of other banks. They are data 
obtained from sharing agreements with other banks or from data vendors. Vendor 
data are used to determine relative loss severity. They can be a useful indicator of 
the ratio of mean loss severity in Business Unit A to mean loss severity in Business 
Unit B or the ratio of the standard deviation of loss severity in Business Unit A 
to the standard deviation of loss severity in Business Unit B. 

23.9 The Poisson distribution is often used for loss frequency. The lognormal distribu- 
tion is often used for loss severity. 

23.10 Examples of key risk indicators are staff turnover, number of failed transactions, 
number of positions filled by temps, ratio of supervisors to staff, number of open 
positions, and percentage of staff who did not take 10 days of consecutive leave in 
the past 12 months. 

23.11 When the loss frequency is 3, the mean total loss is about 3.3 and the standard 
deviation is about 2.0. When the loss frequency is increased to 4, the mean loss is 
about 4.4 and the standard deviation of the loss is about 2.4. 

23.12 The key inputs are the business indicator and (for banks with a business indicator 
over $1 billion) the losses experienced over the previous 10 years. The business 
indicator is a measure of a bank’s size that is relevant for operational risk purposes. 

23.13 For Bank A the Loss Component (in euro millions) is 7 X 20 + 7 X 20 = 280 
while for Bank B it is 7 X 20 + 7 X 20 + 5 x 20 = 380. For Bank A the Internal 
Loss Multiplier is 


In (e- 1+ =| = 0.975 
300 
while for Bank B it is 
In (e —1+ =) = 1.094 
300 


In millions of euros, Bank A’s SMA capital is 110 + (300 — 110) x 0.975 = 295 
while Bank B’s SMA capital is 110 + (300 — 110) x 1.094 = 318. 
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Chapter 24 


24.1 Investors did not know very much about the mortgages underlying the tranches 
that were created, and the waterfalls were complex. 

24.2 The company or individual providing the quotes is prepared to buy at 50 and 
sell at 55. The mid-market quote is 52.5. The proportional bid—offer spread is 
5.0/52.5 or 0.0952. 

24.3 The bid—offer spread for the holding in Company A is 0.01 x 5,000 = $50. The 
bid—offer spread for the holding in Company B is 0.02 x 3,000 = $60. The cost 
of unwinding the portfolio is (50 + 60)/2 or $55. 

24.4 The bid—offer spread for the first holding that we are 95% confident will not be 
exceeded is 5,000 X(0.01 + 1.645 x 0.01) = $132.24. The bid—offer spread for the 
second holding that we are 95% confident will not be exceeded is 3,000 X (0.02 + 
1.645 x 0.03) = $204.04. The total cost of unwinding that we are 95% certain will 
not be exceeded is (132.24 + 204.04)/2 = $170.14. 

24.5 The amounts traded on successive days should be 15.9, 12.9, 10.0, 7.4, 5.2, 3.4, 2.2, 
1.4,0.9, and 0.7. The bid—offer spread cost is $13.4. The total price variance is 36.6 
so that the VaR for the market risk with a 95% confidence level is 1.645 X y 36.6 
= $9.9. (The objective function that is minimized is the sum of these two, or 
$23.3.) 

24.6 LCR tests whether a bank has sufficient liquidity to survive 30 days of extreme 
stress (credit downgrades, loss of deposits, drawdowns on lines of credit, etc.). 
NSFR tests how well the maturities of assets and liabilities are matched 
and discourages overreliance on short-term sources of funds for long-term 
needs. 

24.7 Wholesale deposits are more likely to disappear in stressed market conditions. 

24.8 Their hedging led to a loss on the hedge and a gain on the position being hedged. 
The loss on the hedge gave rise to margin calls. Unfortunately, the position being 
hedged, although it increased in value to the company, was illiquid. 

24.9 Positive feedback trading refers to situations where traders accentuate market 
movements. They buy when prices increase and sell when prices decrease. Nega- 
tive feedback trading is when traders do the reverse; that is, they buy when prices 
decline and sell when prices increase. Positive feedback trading 1s liable to lead to 
a liquidity problem. 

24.10 This is a VaR measure that includes an adjustment for the bid—offer spread costs 
that are incurred in a close-out of the position. 

24.11 Liquidity black holes occur when most market participants want to be on one side 
of a market. Regulation is liable to lead to liquidity black holes because, when all 
financial institutions are regulated in the same way, they tend to want to respond 
to external economic events in the same way. 

24.12 Liquidity black holes are typically caused by too many traders following the same 
trading strategy. If traders follow diverse trading strategies, liquidity black holes are 
less likely to occur. 
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Chapter 25 


25.1 Marking to market involves valuing a position using the prices at which the same 
or similar positions are trading in the market. Marking to model occurs when a 
model plays a key role in determining the price of an instrument. 

25.2 Models may have fundamental errors or they may be used inappropriately. 

25.3 The model developer may leave. Also, other employees of the financial institution, 
as well as its regulators, can understand what has been done. 

25.4 When plain vanilla call and put options are being priced, traders do use the Black— 
Scholes—Merton model as an interpolation tool. They calculate implied volatilities 
for the options that are actively traded. By interpolating between strike prices and 
between times to maturity, they estimate implied volatilities for other options. 
These implied volatilities are then substituted into the Black—Scholes—Merton 
model to calculate prices for these options. However, Black—Scholes—Merton is 
more than an interpolation tool when used for hedging. 

25.5 13.45%. We get the same answer by (a) interpolating between strike prices of 1.00 
and 1.05 and then between maturities six months and one year and (b) interpolat- 
ing between maturities of six months and one year and then between strike prices 
of 1.00 and 1.05. 

25.6 The models of physics describe the behavior of physical processes. The models of 
finance ultimately describe the behavior of human beings. 

25.7 It might notice that it is getting a large amount of business of a certain type 
because it is quoting prices different from its competitors. The pricing differences 
might also become apparent if it decides to unwind transactions and approaches 
competitors for quotes. Also, it might subscribe to a service where once a month 
it obtains the average price quotes by dealers for particular transactions. 

25.8 Within-model hedging involves hedging against changes in variables that the 
model assumes to be stochastic. Outside-model hedging involves hedging against 
parameters that the model assumes to be constant. 

25.9 P&L decomposition analyzes day-to-day profits and losses to distinguish the effects 
of (a) unhedged risks, (b) the hedging model operating imperfectly, and (c) new 
trades. 

25.10 Level 1 transactions are transactions where there are quoted prices for the same 
transaction in active markets. Level 2 transactions are transactions where there are 
quoted prices for similar transactions in active markets or for the same transaction 
in markets that are not active. Level 3 transactions are transactions where some 
modeling assumptions are required. 

25.11 Over-fitting involves trying to match too much of what is observed. Over- 
parameterization involves making a model too complex and difficult to use by 
introducing extra parameters. 

25.12 An option with a strike price of 0.90 and a time to maturity of 1.5 years has an 
implied volatility of 14.85%. An option with a strike price of 0.95 and a time to 
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maturity of 1.5 years has an implied volatility of 14.20%. An option with a strike 
price of 0.92 and a maturity of 1.5 years has an implied volatility of 0.6 x 14.85% 
+ 0.4 X 14.20%, which is 14.59%. 

25.13 The term marking to market refers to the practice of revaluing instruments (usually 
daily) so that they are consistent with the market. The prices calculated for actively 
traded products do reflect market prices. The model is used merely as an inter- 
polation tool. The term marking to market is therefore accurate for these products. 
The prices for structured products depend on the model being used, hence the 
term marking to model. 


Chapter 26 


26.1 Economic capital is a bank’s own estimate of the capital it requires. Regulatory 
capital is the capital it is required to keep by bank supervisors. 

26.2 The probability of a AA-rated company defaulting in one year. 

26.3 Business risk includes risks relating to strategic decisions and reputation. 

26.4 The models used for economic capital are likely to be broadly similar to those 
used to calculate regulatory capital in the case of market risk and operational risk. 
When calculating credit risk economic capital, a bank may consider it appropriate 
to use a different credit correlation model and different correlation parameters 
from those used in regulatory capital calculations. 

26.5 The 99.97% worst-case value of the logarithm of the loss is 0.5 +4 X 3.43 = 
14.23. The 99.97% worst-case loss is therefore $1.510 million. From the properties 
of the lognormal distribution, the expected loss is exp(0.5 + 47/2) or $4,915. The 
capital requirement is therefore $1.505 million. 

26.6 The economic capital for Business Unit 1 is 96.85. The economic capital for 
Business Unit 2 is 63.87. The total economic capital is 124.66. 

26.7 The incremental effect of Business Unit 1 on total economic capital is 60.78. 
The incremental effect of Business Unit 2 on total economic capital is 27.81. This 
suggests that 60.78 / (60.78 + 27.81) or 68.61% of economic capital should be allo- 
cated to Business Unit 1, and 27.81/(60.78 + 27.81) or 31.39% to Business Unit 
2. The marginal effect of increasing the size of Business Unit 1 by 0.5% is 0.4182. 
The marginal effect of increasing the size of Business Unit 2 by 0.5% is 0.2056. 
Euler’s theorem is satisfied because the total economic capital is approximately 
equal to the sum of 0.4182/0.005 and 0.2056/0.005. 

26.8 The capital is $38 million and the return before tax is $7 million. The before-tax 
RAROC is therefore 18.4%. In practice, the allocation of diversification benefits 
to this venture might reduce capital and increase RAROC. 

26.9 RAROC can be used to compare the past performance of different business units 
(for performance appraisal) or to project the expected future performance of busi- 
ness units (to decide how capital should be allocated). 
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Chapter 27 


27.1 In top-down approaches, risk limits are set at the senior management or board 
level for the whole organization. In bottom-up approaches, risk limits are set for 
the different business units. Both approaches are necessary in ERM because it is 
necessary to ensure that risk limits for the business units, when aggregated, are 
consistent with the risk limit for the whole organization. 

27.2 The Bowman paradox is based on empirical data showing that taking higher risks 
tends to lead to lower returns as far as major strategic investments are concerned. 

27.3 There is a danger that decisions will be made that are profitable in the short term 
but lead to long-term problems (e.g., negative publicity, loss of reputation, and 
lawsuits). Business Snapshots 5.4, 27.1, 27.2, and 27.3 are examples. 

27.4 Part of the bonus for a year can be deferred with the possibility of clawbacks. 

27.5 ERM takes a holistic (big picture) approach to risk management rather than sep- 
arately considering the management of different types of risk (credit risk, market 
risk, operational risk, etc.). 

27.6 In equation (27.1), we set R = —0.2, Rp = 0.03, Ry = 0.1174, © y = 0.1886, and 
p = 0.9. The appropriate beta is 1.49. 

27.7 Each quarter from year 2 onward, the interest rate would increase by 4%. After 16 
quarters, the rate would increase to 1.76% + 16 x 4% = 65.76%. 

27.8 Examples of cognitive biases are (a) anchoring (making too small adjustments 
to an initial estimate), (b) availability (recent information given too much 
weight), (c) representativeness (placing too much reliance on experience when 
it is limited or not representative of a current situation), and (d) inverting the 
conditionality. 


Chapter 28 


28.1 Disintermediation occurs when two sides that previously interacted through an 
intermediary start to interact directly. Reintermediation occurs when, following 
disintermediation, a new technology intermediary (providing a different service 
from the previous intermediary) enters the market. 

28.2 Machine learning is a branch of artificial intelligence where computers learn with- 
out being explicitly programmed. Distributed ledger technologies are technolo- 
gies like blockchain where a database is consensually shared and synchronized 
across multiple computers. 

28.3 Biometric authorization uses the unique biological characteristics of individuals 
to verify that they are who they say they are. Fingerprints, retinal scans, facial 
recognition, voice authentication, and heartbeat monitoring are examples of what 
can be used. 

28.4 It might think a digital currency makes it easier to control the money supply and 
manage inflation. 
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28.5 In P2P lending, borrowers and lenders transact directly with each other using an 
online platform provided by a technology company. Equity crowdfunding is a 
mechanism that allows a broad group of investors to fund start-ups via an online 
platform. 

28.6 Regulation is a barrier to entry for FinTechs. In order to conform with regulation, 
FinTechs sometimes have to offer their services through an established financial 
institution. 

28.7 It is true that if everyone invested only in index funds there would be no price 
discovery. We need some investors and fund managers to be following news reports 
and actively searching for stocks that should be bought or sold. 

28.8 RegTech refers to technology developments that address regulatory compliance 
challenges. Examples are systems for online real-time reporting, systems to screen 
customers and transactions to avoid compliance issues, and online libraries mak- 
ing it easy for financial institutions to retrieve information about regulations on 
different subjects in different countries. 

28.9 IBM recognized that it could not rely on the sale of mainframes and changed 
its culture and business model to reflect changes in the market. Kodak did not 
change its culture and business model to the same extent as the photography 
market changed. Banks will need to change both their culture and their business 
model in order to survive. 
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ABS See Asset-Backed Security. 

ABS CDO A security created from the tranches of different ABSs. 

Accrued Interest The interest earned on a bond since the last coupon payment date. 

Additional Tier 1 Capital Items such as non-cumulative preferred stock that do not 
qualify as Tier 1 equity capital. 

Add-On Factor When the credit equivalent amount for a derivatives transaction is 
being calculated, this is the percentage of principal added to the current exposure 
to allow for possible future changes in the value of the derivative. 

Adjoint Differentiation A way of modifying a computer program so that Greek letters 
are calculated in a computationally efficient way. 

Advanced Measurement Approach The way in which the most sophisticated banks 
will be allowed to calculate regulatory capital for operational risk under Basel II. 

Adverse Selection The phenomenon that, if an insurance company offers the same 
premiums to everyone, it tends to end up providing coverage for the worst risks. 

Agency Costs Costs in a business relationship where the interests of the two parties are 
not perfectly aligned. 

Alpha Return earned on a portfolio in excess of that predicted by the capital asset 
pricing model. 

Alternative Investments See Hedge Funds. 

American Option An option that can be exercised at any time during its life. 

Analytic Result Result where the answer is in the form of an equation. 

Arbitrage A trading strategy that takes advantage of two or more securities being mis- 
priced relative to each other. 

Arbitrage Pricing Theory A theory where the return from an investment is assumed 
to depend on several factors. 
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Arbitrageur An individual engaging in arbitrage. 

ASF Factor Available stable funding factor, a weighting factor used for sources of fund- 
ing in the calculation of the Net Stable Funding Ratio. 

Asian Option An option with a payoff dependent on the average price of the underlying 
asset during a specified period. 

Ask Price The price that a dealer is offering to sell an asset. 

Asked Price See Ask Price. 

Asset Swap Exchanges the promised coupon on a bond for LIBOR plus a spread. 

Asset-Backed Security A security created from the cash flows from bonds, mortgages, 
credit card receivables, or other instruments. 

Asset Swap Exchanges the promised coupon on a bond for LIBOR plus a spread. 

At-the-Money Option An option in which the strike price equals the price of the 
underlying asset. 

Autocorrelation The correlation between the value of a variable and the value of the 
same variable k days later. (k is referred to as the time lag.) 

Average Price Call Option An option giving a payoff equal to the greater of zero and 
the amount by which the average price of the asset exceeds the strike price. 

Average Price Put Option An option giving a payoff equal to the greater of zero and 
the amount by which the strike price exceeds the average price of the asset. 

Back-End Load Fee charged when an investment in a mutual fund is terminated. 

Back Office Where the record keeping takes place. 

Back-Testing Testing a value-at-risk or other model using historical data. 

Backward Induction A procedure for working from the end of a tree to its beginning 
in order to value an option. 

Banking Book Part of a bank’s portfolio that consists of instruments that are expected 
to be held to maturity. 

Bankruptcy Costs Costs such as lost sales, loss of key managers, and professional fees 
arising from a declaration of bankruptcy. These costs are not associated with the adverse 
events leading to bankruptcy. 

Barrier Option An option whose payoff depends on whether the path of the underlying 
asset has reached a barrier (i.e., a certain predetermined level). 

Basel I The first international agreement on the regulation of banks in 1988. 

Basel II New international regulations for calculating bank capital introduced in 
2007. 

Basel II.5 Extra capital charges for items in the trading book, introduced following the 
credit crisis. 

Basel III International banking regulations introduced in 2010 involving capital for the 
banking book and liquidity ratios. 

Basel Committee The committee of bank regulators from around the world that pro- 
duces standards intended to apply to banks in all countries. 

Basic Indicator Approach The simplest way of calculating regulatory capital for oper- 
ational risk under Basel II. 

Basis The difference between the spot price and the futures price of a commodity. 
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Basis Point When used to describe an interest rate, a basis point is one-hundredth of 
1% (= 0.01%). 

Basis Risk The risk to a hedger arising from uncertainty about the basis at a future time. 

Basket Credit Default Swap A credit default swap where there are several reference 
entities. 

Basket Option An option on a portfolio of assets. 

Bermudan Option An option that can be exercised on certain dates during its life. 

Best Efforts A phrase used to describe the situation where an investment bank agrees 
that it will do the best it can to sell a new issue of securities at a certain price, but does 
not guarantee that it will be able to sell them. 

Beta A measure of the systematic risk of an asset. 

Bid-Ask Spread The amount by which the ask price exceeds the bid price. 

Bid-Offer Spread See Bid-Ask Spread. 

Bid Price The price that a dealer is prepared to pay for an asset. 

Bilateral Clearing An arrangement between two parties to handle transactions in the 
OTC market, often involving an ISDA master agreement. 

Binary Credit Default Swap An instrument where there is a fixed dollar payoff in the 
event of a default by a particular company. 

Binary Option An option with a discontinuous payoff; for example, a cash-or-nothing 
option or an asset-or-nothing option. 

Binomial Model A model where the price of an asset is monitored over successive short 
periods of time. In each short period it is assumed that only two price movements are 
possible. 

Binomial Tree A tree that represents how an asset price can evolve under the binomial 
model. 

BIS Accord An agreement reached in 1988 between the central banks of 12 countries 
concerning how banks should be regulated. 

Bitcoin The first cryptocurrency. 

Bivariate Normal Distribution A joint distribution for two correlated variables, each 
of which has a normal distribution. 

Black’s Model An extension of the Black—Scholes—Merton model for valuing European 
options on futures contracts. It is used extensively in practice to value European options 
when the distribution of the asset price at maturity is assumed to be lognormal. 

Black—-Scholes—Merton Model A model for pricing European options on stocks, devel- 
oped by Fischer Black, Myron Scholes, and Robert Merton. 

Blockchain Distributed ledger technology where a secure database is consensually 
shared and synchronized across multiple computers. 

Bond Option An option where a bond is the underlying asset. 

Bond Yield The discount rate which, when applied to all the cash flows of a bond, 
causes the present value of the cash flows to equal the bond’s market price. 

Bootstrap Method A procedure for calculating the zero-coupon yield curve from mar- 
ket data. Also a statistical procedure for calculating confidence levels when distributions 
are determined empirically. 
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Bunching A tendency for days when the loss is greater than the value at risk to be 
clustered close together. 

Business Risk When used by a bank, this refers to strategic risk (related to a bank’s 
decision to enter new markets and develop new products) and reputation risk. 

Buy and Hold A trading strategy where a position is set up and there is no subsequent 
trading of the position. 

Calendar Days Includes every day. 

Calibration A method for implying a model’s parameters from the prices of actively 
traded options. 

Callable Bond A bond containing provisions that allow the issuer to buy it back at a 
predetermined price at certain times during its life. 

Call Option An option to buy an asset at a certain price by a certain date. 

Cancelable Swap A swap that can be canceled by one side on prespecified dates. 

Cap See Interest-Rate Cap. 

Capital Adequacy The adequacy of the capital held by a bank or other financial insti- 
tution. 

Capital Asset Pricing Model A model relating the expected return on an asset to its 
beta. 

Capital Conservation Buffer Extra equity capital that, under Basel III, must be kept so 
that it is available to absorb losses during downturns. 

Caplet One component of an interest rate cap. 

Cap Rate The rate determining payoffs in an interest rate cap. 

Cascade Approach Describes the calculation methodology in FRTB’s internal models 
approach. 

Case-Shiller Index An index of house prices in the United States. 

Cash-Flow Mapping A procedure for representing an instrument as a portfolio of zero- 
coupon bonds for the purpose of calculating value at risk. 

Cash Settlement A procedure for settling a contract in cash rather than by delivering 
the underlying asset. 

CAT Bond A bond where the interest and, possibly, the principal paid are reduced if a 
particular category of “catastrophic” insurance claims exceeds a certain amount. 

CCAR The Comprehensive Capital Analysis and Review is a stress test in the United 
States for banks with consolidated assets over $50 billion. 

CCP See Central Clearing Party. 

CDD See Cooling Degree Days. 

CDO See Collateralized Debt Obligation. 

CDO Squared An instrument in which the default risks in a portfolio of CDO tranches 
are allocated to new securities. 

CDS See Credit Default Swap. 

CDS-Bond Basis The excess of the CDS spread over the asset swap spread. 

CDX An index of the credit quality of 125 North American investment grade compa- 
nies. 

Central Clearing The use of a central counterparty (CCP) for OTC transactions. 
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Central Counterparty Clearing house for over-the-counter transactions. 

Central Limit Theorem A theorem that states that the sum of a large number of inde- 
pendent identically distributed variables 1s approximately normally distributed even if 
the variables themselves are not normally distributed. 

Cheapest-to-Deliver Bond A bond that is cheapest to buy and then deliver in a futures 
contract or a credit default swap contract. 

Chinese Walls A phrase used to describe the policies in place within a financial insti- 
tution that prevent information flowing from one part of the financial institution to 
another when this would be disadvantageous to one or more of the financial institu- 
tion’s clients. 

Cholesky Decomposition A method of sampling from a multivariate normal distri- 
bution. 

Clawback In the case of a hedge fund, a clawback allows a percentage of previous incen- 
tive fees to be recouped to compensate investors for current losses. In the case of a 
deferred bonus, it leads to the bonus not being paid if financial performance deterio- 
rates. 

Clean Price of Bond The quoted price of a bond. The cash price paid for the bond 
(or dirty price) is calculated by adding the accrued interest to the clean price. 

Clearing House A firm that guarantees the performance of the parties in an exchange- 
traded derivatives transaction. (Also referred to as a clearing corporation.) 

Clearing Margin A margin posted by a member of a clearing house. 

Closed-End Fund A mutual fund where there is a fixed number of shares. 

CoCo See Contingent Convertible Bond. 

Cognitive Bias [logical pattern of judgment. 

Coherent Risk Measure A risk measure satisfying a number of conditions. 

Collar See Interest-Rate Collar. 

Collateral Cash or marketable securities that must be posted by a party to a trans- 
action. 

Collateralization A system for posting collateral by one or both parties in a derivatives 
transaction. 

Collateralized Debt Obligation A way of packaging credit risk. Several classes of secu- 
rities (known as tranches) are created from a portfolio of bonds and there are rules for 
determining how the costs of defaults are allocated to classes. 

Commercial Bank A bank that takes deposits and makes loans. 

Component Value at Risk VaR corresponding to a component of a portfolio. Defined 
so that the sum of the component VaRs for the components of a portfolio equals the 
VaR for the whole portfolio. 

Compounding Frequency This defines how an interest rate is measured. 

Compounding Swap A swap where interest compounds instead of being paid. 

Compound Option An option on an option. 

Comprehensive Risk Measure A risk measure that calculates the capital charge for 
instruments dependent on credit correlation. 

Conditional Tail Expectation See Expected Shortfall. 
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Conditional Value at Risk (C-VaR) See Expected Shortfall. 

Confidence Level A probability indicating the level of confidence that an event will (or 
will not) happen. In value at risk calculations, the confidence level is the probability 
that the loss will not exceed the value at risk. 

Confirmation A contract confirming a verbal agreement between two parties to a trade 
in the over-the-counter market. 

Constant Level of Risk An assumption in the calculation of a risk measure that a 
financial institution will trade periodically so that its risk at the beginning of each 
period is the same as today. (Can be contrasted with Buy and Hold.) 

Consumption Asset An asset held for consumption rather than investment. 

Contagion See Credit Contagion. 

Contingent Convertible Bond (CoCo) Bond that is automatically converted to equity 
when there is a trigger indicating that the issuer needs more equity capital. 

Continuous Compounding A way of quoting interest rates. It is the limit as the assumed 
compounding interval is made smaller and smaller. 

Convenience Yield A measure of the benefits from ownership of an asset that are not 
obtained by the holder of a long futures contract on the asset. 

Conversion Factor A factor multiplied by principal to convert an off-balance-sheet 
item to its credit equivalent amount. 

Convertible Bond A corporate bond that can be converted by the holder into 
a predetermined amount of the company’s equity at certain times during its 
life. 

Convexity A measure of the curvature in the relationship between bond prices and 
bond yields. 

Convexity Adjustment An overworked term. For example, it can refer to the adjust- 
ment necessary to convert a futures interest rate to a forward interest rate. It can also 
refer to the adjustment to a forward rate that is sometimes necessary when instruments 
are valued. 

Cooke Ratio A ratio of capital to risk-weighted assets under Basel I. 

Cooling Degree Days The maximum of zero and the amount by which the daily 
average temperature is greater than 65 degrees Fahrenheit. The average temperature 
is the average of the highest and lowest temperatures (midnight to midnight). 

Copula A way of defining the correlation between variables with known distributions. 

Core Tier 1 Capital See Tier 1 Equity Capital. 

Cornish-Fisher Expansion An approximate relationship between the fractiles ofa prob- 
ability distribution and its moments. 

Correlation A measure of the extent to which there is a linear relation between two 
variables. 

Correlation Matrix An n X n matrix where the {i,j} element is the correlation between 
variable i and variable j. 

Cost of Carry The storage costs plus the cost of financing an asset minus the income 
earned on the asset. 

Countercyclical Buffer An extra capital charge, under Basel III, that is left to the dis- 
cretion of national regulators. 
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Counterparty The opposite side in a financial transaction. 

Coupon The interest payment made on a bond. 

Covariance A measure of the linear relationship between two variables (equals the cor- 
relation between the variables times the product of their standard deviations). 

Covariance Rate The covariance between daily returns of two variables. 

Covered Call A short position in a call option on an asset combined with a long position 
in the asset. 

Crashophobia The fear of a stock market crash similar to that in 1987 that some people 
claim causes market participants to increase the value of deep-out-of-the-money put 
options. 

Credit Contagion The process whereby credit losses in one sector of the world econ- 
omy lead to credit losses in other sectors. 

Credit Default Swap An instrument that gives the holder the right to sell a bond for 
its face value in the event of a default by the issuer. 

Credit Derivative A derivative whose payoff depends on the creditworthiness of one 
or more companies or countries. 

Credit Equivalent Amount Size of loan that is considered equivalent to an off-balance- 
sheet transaction in Basel I. 

Credit Event A default or other event that triggers a payout in a credit default 
swap. 

Credit Indices Indices that track the cost of buying protection for companies in a spec- 
ified portfolio. 

CreditMetrics A procedure for calculating credit value at risk. 

Credit Migration The movement of a company from one rating category to another. 

Credit Rating A measure of the creditworthiness of a bond issue. 

Credit Ratings Transition Matrix A table showing the probability that a company will 
move from one credit rating to another during a certain period of time. 

Credit Risk The risk that a loss will be experienced because of a default by the coun- 
terparty in a derivatives transaction. 

Credit Risk Plus A procedure for calculating credit value at risk. 

Credit Spread The cost of buying credit protection. Also, the difference between the 
yield on a bond that might default and the yield on a risk-free bond. 

Credit Support Annex Part ofan ISDA master agreement specifying collateral arrange- 
ments. 

Credit Transition Matrix See Credit Ratings Transition Matrix. 

Credit Value Adjustment The adjustment to the value of derivatives transactions made 
by a dealer to allow for the possibility that the counterparty may default. 

Credit Value at Risk The credit loss that will not be exceeded at some specified con- 
fidence level. 

Crowdfunding Funding a project or a company using an online platform. 

Cryptocurrency Digital currency designed to provide a secure medium of exchange. 

CSA See Credit Support Annex. 

Cumulative Distribution Function The probability that a variable will be less than x 
as a function of x. 
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Cure Period A period of time assumed to elapse between a default event and a termi- 
nation in OTC derivatives transactions that are settled bilaterally. 

Currency Swap A swap where interest and principal in one currency are exchanged 
for interest and principal in another currency. 

Current Exposure Method A method for calculating exposure at default on derivatives 
in Basel I and Basel II. It sets exposure at default equal to current exposure plus an 
add-on factor. 

Current VaR or ES VaR or ES calculated using most recent data, not data from a 
stressed period. 

Curvature Gamma. 

CVA See Credit Value Adjustment. 

Day Count A convention for quoting interest rates. 

Day Trade A trade that is entered into and closed out on the same day. 

Debit Value Adjustment An adjustment to the value of derivatives transactions made 
by a dealer to allow for the possibility that the dealer may default. 

Debt Value Adjustment See Debit Value Adjustment. 

Default Correlation Measures the tendency of two companies to default at about the 
same time. 

Default Intensity See Hazard Rate. 

Deferred Annuity An annuity where time elapses between the payment of the funds 
that will provide the annuity and the start of the annuity. 

Defined Benefit Plan A pension plan where there is a formula defining the pension 
that will be received. Typically, the formula depends on the number of years of service 
and the salary during the final years of service. 

Defined Contribution Plan A pension plan where an employee’s contributions plus the 
contributions made for the employee by the employer are kept in a separate account 
and invested. On retirement the funds in the account are usually converted into an 
annuity. Sometimes, they can be taken out as a lump sum. 

Deleveraging The process of individuals and companies reducing their borrowing. 

Delivery Price The price that will be paid or received in a forward contract. 

Delta The rate of change of the price of a derivative with the price of the underlying 
asset. 

Delta Hedging A hedging scheme that is designed to make the price of a portfolio of 
derivatives insensitive to small changes in the price of the underlying asset. 

Delta-Neutral Portfolio A portfolio with a delta of zero so that there is no sensitivity 
to small changes in the price of the underlying asset. 

Dependence Variable A is dependent on variable B if knowing the value of B affects 
the probability density function of A. 

Deposit Insurance Government programs for providing restitution to the depositors of 
a bank in the event that the bank fails. 

DerivaGem Software for valuing options, available on the author’s website. 

Derivative An instrument whose price depends on, or is derived from, the price of 
another asset. 
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Deterministic Variable A variable whose future value is known. 

DFAST The Dodd—Frank Act Stress Test is a stress test in the United States for banks 
with consolidated assets over $10 billion. 

Digital Currency Type of currency available only in digital form. It is not available as 
banknotes or coins. 

Directed Brokerage A phrase used to describe the situation where a fund directs its 
trades to a brokerage house in return for the brokerage house advising its clients to 
buy the fund. 

Dirty Price of Bond The cash price of a bond. 

Discount Bond See Zero-Coupon Bond. 

Discount Instrument An instrument, such as a Treasury bill, that provides no coupons. 

Discount Rate The annualized dollar return on a Treasury bill or similar instrument 
expressed as a percentage of the final face value. 

Disintermediation Change where less use is made of intermediaries. 

Distance to Default The number of standard deviations that the value of a company’s 
assets must move for a default to be triggered. 

Diversification Reducing risk by dividing a portfolio between many different assets. 

Dividend A cash payment made to the owner of a stock. 

Dividend Yield The dividend as a percentage of the stock price. 

Dodd—Frank Act An act in the United States that was introduced following the credit 
crisis and is designed to protect consumers and investors, avoid future bailouts, and 
monitor the functioning of the financial system more carefully. 

Dollar Convexity The convexity of an interest rate—-dependent portfolio multiplied by 
the value of the portfolio. 

Dollar Duration The duration of an interest rate—dependent portfolio multiplied by 
the value of the portfolio. 

Down-and-In Option An option that comes into existence when the price of the 
underlying asset declines to a prespecified level. 

Down-and-Out Option An option that ceases to exist when the price of the underlying 
asset declines to a prespecified level. 

Downgrade Trigger A clause in a contract that states that the contract can be termi- 
nated, or collateral demanded, by one side if the credit rating of the other side falls 
below a certain level. 

Duration A measure of the average life of a bond. It is also an approximation of the ratio 
of the proportional change in the bond price to the absolute change in its yield. 

Duration Matching A procedure for matching the durations of assets and liabilities. 

Dutch Auction A process where investors provide bids indicating the number of shares 
they are prepared to buy and the price they are prepared to pay. Suppose there are N 
shares on offer. The price that clears the market is the highest price, P, that is such that 
investors are prepared to buy more than N shares at either P or a higher price than P. 
The investors who have bid more than P get their orders filled at a price of P. The 
investors who have bid P get part of their orders filled at P. 

DVA See Debit Value Adjustment. 
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DV01 The impact of a one-basis-point increase in all interest rates. 

Dynamic Hedging A procedure for hedging an option position by periodically chang- 
ing the position held in the underlying asset. The objective is usually to maintain a 
delta-neutral position. 

Dynamic Scenarios Scenarios involving adverse movements in market variables that 
consider the way companies will respond to the adverse movements. 

EAD See Exposure at Default. 

Early Exercise Exercise prior to the maturity date. 

Early Termination Event The early termination of OTC derivative transactions by one 
side because of an event of default by the other side. 

Economic Capital The capital that a bank’s own calculation indicates it needs. 

Efficient Frontier The optimal trade-offs for an investor between expected return and 
standard deviation of return. 

Efficient Market Hypothesis A hypothesis that asset prices reflect relevant information. 

Electronic Trading System of trading where a computer is used to match buyers and 
sellers. 

Embedded Option An option that is an inseparable part of another instrument. 

Empirical Research Research based on historical market data. 

Endowment Life Insurance Insurance that pays out a predetermined amount 
on the maturity of the contract or on the policyholder’s death, whichever is 
earlier. 

Enterprise Risk Management A holistic approach to risk management. 

Equity Swap A swap where the return on an equity portfolio is exchanged for either a 
fixed or a floating rate of interest. 

Equity Tranche The tranche that is first to be affected by losses on the underlying 
portfolio. 

ERM See Enterprise Risk Management. 

ES See Expected Shortfall. 

Eurocurrency A currency that is outside the formal control of the issuing country’s 
monetary authorities. 

Eurodollar A dollar held in a bank outside the United States. 

Eurodollar Futures Contract A futures contract written on a Eurodollar deposit. 

Eurodollar Interest Rate The interest rate on a Eurodollar deposit. 

European Option An option that can be exercised only at the end of its life. 

Event of Default An event such as a bankruptcy or a failure to make payments as they 
become due or a failure to post collateral as it becomes due. 

EWMaA Exponentially weighted moving average. 

Exception A situation in back-testing where realized loss exceeds the value-at-risk 
estimate. 

Excess Cost Layers The costs for an insurance company on a particular type of business 
that are within a certain range. 

Excess Kurtosis A measure of the extent to which the tails of a probability distribution 
are fatter than those of a normal distribution. 
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Excess Spread A situation where the aggregate return promised to the tranches is less 
than the promised return on the underlying assets. 

Exchange-Traded Fund A fund created in such a way that units can be exchanged for 
the underlying shares and vice versa by institutional investors. 

Exchange-Traded Market A market organized by an exchange such as the New York 
Stock Exchange or Chicago Board Options Exchange. 

Ex-Dividend Date When a dividend is declared, an ex-dividend date is specified. 
Investors who own shares of the stock just before the ex-dividend date receive the 
dividend. 

Exercise Price The price at which the underlying asset may be bought or sold in an 
option contract. (Also called the strike price.) 

Exotic Option A nonstandard option. 

Expectations Theory The theory that forward interest rates equal expected future spot 
interest rates. 

Expected Shortfall Expected loss during N days conditional on being in the (100 — 
X)% tail of the distribution of losses. The variable N is the time horizon and X% is 
the confidence level. 

Expected Tail Loss See Expected Shortfall. 

Expected Value of a Variable The average value of the variable obtained by weighting 
the alternative values by their probabilities. 

Expense Ratio Ratio of expenses to value of assets under management for a fund. 

Expiration Date The end of life of a contract. 

Exponentially Weighted Moving Average Model A model to provide forecasts for a 
variable from historical data where the weights applied to data decline exponentially 
as we move back through time. It is sometimes applied to variances and covariances 
in value at risk calculations. 

Exponential Weighting A weighting scheme where the weight given to an observation 
depends on how recent it is. The weight given to an observation t time periods ago 
is A times the weight given to an observation t — 1 time periods ago where A < 1. 

Exposure at Default The maximum amount that could be lost (assuming no recovery) 
when a default occurs. 

Extreme Value Theory A theory enabling the shape of the tails of a distribution to be 
estimated from data. 

Factor Source of uncertainty. 

Factor Analysis An analysis aimed at finding a small number of factors that describe 
most of the variation in a large number of correlated variables. (Similar to a principal 
components analysis.) 

Factor Copula A copula involving several variables where a factor model describes the 
correlation structure of the transformed variables. 

Factor Loadings The values of variables in a factor model when we have one unit of a 
particular factor and no units of other factors. 

Factor Model A model where a set of correlated variables are assumed to depend lin- 
early on a number of uncorrelated factors. 
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Factor Score In a factor model, this is the amount of a factor present in a particular 
observation. 

Fair Value Accounting Involves showing financial instruments on the balance sheet at 
their market value. 

FICO A credit score developed by Fair Isaac Corporation. 

Financial Intermediary A bank or other financial institution that facilitates the flow of 
funds between different entities in the economy. 

FinTech Application of technology in finance. 

Firm Commitment Phrase used to describe the situation where an investment bank 
guarantees that a new issue of securities will be sold at a certain price. If investors do 
not want to buy the securities, the investment bank will have to do so. 

Fixed Annuity An annuity (usually for life) where payments that will be received by 
the policyholder are fixed in advance. 

Floor See Interest-Rate Floor. 

Floor-Ceiling Agreement See Collar. 

Floorlet One component of a floor. 

Floor Rate The rate in an interest rate floor agreement. 

Foreign Currency Option An option on a foreign exchange rate. 

Forward Contract A contract that obligates the holder to buy or sell an asset for a 
predetermined delivery price at a predetermined future time. 

Forward Exchange Rate The forward price of one unit of a foreign currency. 

Forward Interest Rate The interest rate for a future period of time implied by the rates 
prevailing in the market today. 

Forward Price The delivery price in a forward contract that causes the contract to be 
worth zero. 

Forward Rate Can refer to a forward interest rate or a forward exchange rate. 

Forward Rate Agreement (FRA) An agreement that a certain interest rate will apply 
to a certain principal amount for a certain time period in the future. 

Front-End Load A fee charged when a mutual fund is purchased. 

Front Office Where the trading takes place. 

Front Running Trading a stock on a personal account before a fund trades a large 
volume of the stock. 

FRTB See Fundamental Review of the Trading Book. 

Fundamental Review of the Trading Book A new way of assigning capital for market 
risk proposed by the Basel Committee. 

Fund of Funds A fund that invests in a portfolio of different hedge funds. 

Futures Contract A contract that obligates the holder to buy or sell an asset at a prede- 
termined delivery price during a specified future time period. The contract is settled 
daily. 

Futures Option An option on a futures contract. 

Futures Price The delivery price currently applicable to a futures contract. 

G-30 Policy Recommendations A set of recommendations concerning derivatives 
issued by nonregulators in 1993. 
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Gamma The rate of change of delta with respect to the asset price. 

Gamma-Neutral Portfolio A portfolio with a gamma of zero. 

Gap Management A procedure for matching the maturities of assets and liabilities. 

GARCH Model A model for forecasting volatility where the variance rate follows a 
mean-reverting process. 

Gaussian Copula Model A copula model based on the multivariate normal 
distribution. 

Girsanov’s Theorem A theorem showing that when we move from a world with one 
set of risk preferences to a world with another set of risk preferences the drifts of 
stochastic variables may change but their volatilities remain the same. 

Glass-Steagall Act An act passed in the United States separating commercial and invest- 
ment banks. 

Greek Letters See Greeks. 

Greeks Hedge parameters such as delta, gamma, vega, theta, and rho. 

Gross Income Net interest income plus non-interest income. 

Group Life Insurance Life insurance arranged for a group of people, usually by an 
employer. 

G-SIB Global systemically important bank. 

Haircut The discount applied to the value of an asset when it is used as collateral. 

Hazard Rate Measures probability of default in a short period of time conditional on 
no earlier default. 

HDD See Heating Degree Days. 

Heating Degree Days The maximum of zero and the amount by which the daily 
average temperature is less than 65 degrees Fahrenheit. The average temperature is the 
average of the highest and lowest temperatures (midnight to midnight). 

Hedge A trade designed to reduce risk. 

Hedge Funds Funds that are subject to less restrictions and less regulation than mutual 
funds, but are limited in the extent to which they can sell units in the fund to the 
general public. 

Hedger An individual who enters into hedging trades. 

Hedge Ratio The ratio of the size of a position in a hedging instrument to the size of 
the position being hedged. 

High-Frequency Trading (HFT) Trading generated by the use of computer algorithms. 

High-Water-Mark Clause Clause stating that gains equal to previous losses must be 
made before incentive fees apply. 

Historical Default Probability Default probability estimated from historical default 
experience. 

Historical Simulation A simulation based on historical data. 

Historical Volatility A volatility estimated from historical data. 

Holiday Calendar Calendar defining which days are holidays for the purposes of deter- 
mining payment dates in a financial transaction. 

Hurdle Rate The return that must be earned by a hedge fund before the incentive fee 
applies. 
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Hybrid Approach An approach to aggregating different types of economic capital. 

IFRS 9 A new accounting standard requiring banks to value loans by taking expected 
default losses into account. 

Implied Default Probability See Risk-Neutral Default Probability. 

Implied Volatility Volatility implied from an option price using the Black—Scholes— 
Merton or a similar model. 

Inception Profit Profit created by selling a derivative for more than its theoretical value. 

Incremental Risk Charge An extra capital charge for credit-sensitive instruments in the 
trading book. The charge reflects the fact that capital requirements for credit instru- 
ments in the banking book are often higher than for equivalent-risk instruments in 
the trading book. 

Incremental Value at Risk The difference between the value at risk with and without 
a particular component of the portfolio. 

Independence Variable A is independent of variable B if knowing the value of variable 
B does not affect the probability density function for variable A. 

Independent Amount Collateral required by a dealer in bilateral OTC trading that is 
independent of the value of outstanding transactions. 

Index Fund A fund that is designed to match the performance of a particular stock 
index. 

Initial Margin The cash required from a futures trader at the time of the trade. 

Initial Public Offering An offering to investors of shares in a company for the first 
time. Prior to an IPO, shares are typically held only by the company’s founders, its 
employees, and the providers of venture capital. 

Instantaneous Forward Rate A forward rate for a very short period of time in the 
future. 

Interest-Rate Cap An option that provides a payoff when a specified interest rate is 
above a certain level. The interest rate is a floating rate that is reset periodically. 

Interest-Rate Collar A combination of an interest-rate cap and an interest-rate floor. 

Interest-Rate Derivative A derivative whose payoffs are dependent on future interest 
rates. 

Interest-Rate Floor An option that provides a payoff when an interest rate is below a 
certain level. The interest rate is a floating rate that is reset periodically. 

Interest-Rate Option An option where the payoff is dependent on the level of interest 
rates. 

Interest-Rate Swap An exchange of a fixed rate of interest on a certain notional prin- 
cipal for a floating rate of interest on the same notional principal. 

Internal Credit Rating A credit rating produced by a financial institution rather than 
by an outside agency such as Moody’s or Standard & Poor’. 

In-the-Money Option Either (a) a call option where the asset price is greater than the 
strike price or (b) a put option where the asset price is less than the strike price. 

Intrinsic Value For a call option, this is the greater of the excess of the asset price over 
the strike price and zero. For a put option, it is the greater of the excess of the strike 
price over the asset price and zero. 
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Investment Asset An asset held by a significant number of individuals for investment 
purposes. 

Investment Bank A bank that helps companies issue debt and equity securities. 

Investment Grade A bond or other instrument with a credit rating of BBB (Baa) or 
above. 

IPO See Initial Public Offering. 

IRB Approach Internal Ratings Based approach for assessing credit risk capital in Basel 
II. 

IRC See Incremental Risk Charge. 

ISDA International Swaps and Derivatives Association. 

ISDA Master Agreement Agreement between two parties governing their OTC 
derivatives trades. 

Isserlis’ Theorem A theorem for calculating the mean of the product of zero-mean 
normally distributed variables. 

ITraxx An index of the credit quality of 125 European investment-grade companies. 

Jump-to-Default Risk The risk that a company will default. It can be contrasted with 
credit spread risk, which is the risk that the company’s credit spread will increase 
without the company defaulting. 

Junk Bond A non-investment-grade bond. 

Key Risk Indicators Indicators to track the level of operational risk. 

Kurtosis A measure of the heaviness of the tails of a distribution. 

Late Trading Trading a mutual fund after 4 P.M. at the 4 P.M. price. 

Leveraging Individuals and companies increasing their borrowings. 

LGD See Loss Given Default. 

Liar Loan A loan where the borrower did not tell the truth on the application form. 

LIBOR See London Interbank Offered Rate. 

LIBOR-in-Arrears Swap A swap where the interest paid on a date is determined by the 
interest rate observed on that date (not by the interest rate observed on the previous 
payment date). 

LIBOR-OIS Spread The difference between the LIBOR rate of interest and the OIS 
rate of interest for a particular maturity. 

LIBOR/Swap Zero Curve Zero rates that are calculated from LIBOR rates, eurodollar 
futures, and swap rates. 

LIBOR Zero Curve See LIBOR/Swap Zero Curve. 

Life Insurance Insurance where payouts depend on when a person dies. 

Linear Model A model where the change in the portfolio value depends linearly on 
the returns of the underlying market variables. 

Linear Product A derivative product whose price depends linearly on one or more 
underlying variables. 

Liquidity-Adjusted VaR A value at risk calculation that takes account of the impact of 
the bid—offer spread when positions are unwound. 

Liquidity Black Hole A situation where liquidity dries up because everyone wants to 
be on the same side of the market. 
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Liquidity Coverage Ratio The ratio of high-quality liquid assets to cash outflows dur- 
ing a stressed period of 30 days. 

Liquidity Funding Risk The risk that sources of funding will dry up. 

Liquidity Horizon The time horizon used for a risk factor in FRTB’s expected shortfall 
calculations. 

Liquidity Preference Theory A theory leading to the conclusion that forward interest 
rates are above expected future spot interest rates. 

Liquidity Premium The amount that forward interest rates exceed expected future spot 
interest rates. 

Liquidity Trading Risk The risk that it will not be possible to unwind positions in 
financial instruments at their theoretical price. 

Living Will A plan for winding up a financial institution in an orderly way so that some 
parts possibly survive. 

Lognormal Distribution A variable has a lognormal distribution when the logarithm 
of the variable has a normal distribution. 

London Interbank Offered Rate The rate at which an AA-rated bank can borrow 
from other banks. 

Longevity Bond A bond where a population of individuals is defined and the coupon 
paid at a particular time is proportional to the number of people in the population 
who are alive at that time. 

Longevity Risk The risk that people will live longer than expected. 

Long Position A position involving the purchase of an asset. 

Long-Tail Risk The risk in property—casualty insurance that there will be claims made 
well after the end of the life of a policy. 

Lookback Option An option whose payoff is dependent on the maximum or minimum 
of the asset price achieved during a certain period. 

Loss Given Default The percentage of the exposure to a counterparty that is lost when 
a default by the counterparty occurs. 

Macaulay’s Duration The weighted average of the times when cash flows are received 
where the weight applied to a cash flow is proportional to the present value of the 
cash flow. 

Machine Learning A branch of artificial intelligence that allows computers to learn 
without being explicitly programmed. 

Maintenance Margin When the balance in a trader’s margin account falls below the 
maintenance margin level, the trader receives a margin call requiring the margin 
account to be topped up. 

Margin The amount of collateral required in cash or marketable securities from a trader. 

Marginal Distribution The unconditional distribution of a variable. 

Marginal Value at Risk The rate of change of the value at risk with the size of one 
component of the portfolio. 

Margin Call A request for extra margin when the balance in the margin account falls 
below the maintenance margin level. 
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Margin Period of Risk See Cure Period. 

Marginal Distribution The unconditional distribution of a variable. 

Market Maker A trader who is willing to quote both bid and offer prices for an asset. 

Market Model A model most commonly used by traders. 

Market Portfolio A portfolio consisting of the universe of all possible investments. 

Market Risk A risk relating to movements in market variables. 

Market Timing Frequent trading of a mutual fund to benefit from stale prices or to do 
late trading. 

Marking to Market The practice of revaluing a financial instrument to reflect the cur- 
rent values of the relevant market variables. 

Marking to Model Use of a model to determine the current value of a financial instru- 
ment. 

Maturity Date The end of the life of a contract. 

Maximum Likelihood Method A method for choosing the values of parameters by 
maximizing the probability of a set of observations occurring. 

Mean Reversion The tendency of a market variable (such as a volatility or an interest 
rate) to revert back to some long-run average level. 

Merton’s Model A model that uses equity prices to estimate default probabilities. (Other 
models developed by Merton are also sometimes referred to as Merton’s model.) 

Mezzanine Tranche A tranche that experiences losses after the equity tranche but 
before the senior tranche. 

Middle Office Where risk management takes place. 

Minimum Transfer Amount The minimum amount transferred in a collateralization 
agreement. 

Model Building Approach The use of a model to estimate value at risk. 

Model Risk The risk relating to the use of models to price derivative products. 

Modified Duration A modification to the standard duration measure so that it more 
accurately describes the relationship between proportional changes in a bond price 
and actual changes in its yield. The modification takes account of the compounding 
frequency with which the yield is quoted. 

Money Center Banks Banks with a global presence that fund themselves to a significant 
extent in wholesale markets. 

Monte Carlo Simulation A procedure for randomly sampling changes in market vari- 
ables. 

Moral Hazard The possibility that the behavior of an insured entity will change because 
of the existence of an insurance contract. 

Mortality Risk The risk that people will die earlier than expected. 

Multibank Holding Company A holding company that owns several banks. This struc- 
ture was used in the United States for getting around some bank regulations. 

Multivariate Normal Distribution The joint distribution of many variables, each of 
which is normal. 

Mutual Fund A vehicle for investing the funds of many small investors. 
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Naked Position A short position in a call option that is not combined with a long 
position in the underlying asset. 

National Association of Insurance Commissioners A national body that provides 
services to the insurance regulators in each state in the United States. 

Negative Feedback Trading Trading where assets are sold after a price increase and 
bought after a price decrease. 

Net Asset Value The value of a fund’s investments divided by the number of shares in 
the fund. 

Net Interest Income The excess of interest earned over interest paid for a bank. 

Net Replacement Ratio The ratio of current exposure with netting to current expo- 
sure without netting. 

Net Stable Funding Ratio The weighted average of funding available divided by 
weighted average of funding required. 

Netting The ability to offset contracts with positive and negative values in the event of 
a default by a counterparty. 

Neural Networks A machine learning tool that allows computers to recognize patterns 
in a similar way to humans. 

NINJA A term used to describe a poor credit risk: no income, no job, no assets. 

Node Delta When the term structure is piecewise linear, this measures the impact on 
a portfolio of changing the interest rate corresponding to one of the vertices by one 
basis point while keeping the other vertices unchanged. 

Non-Investment Grade A bond or other instrument with a credit rating below BBB 
(Baa). 

Nonlife Insurance See Property—Casualty Insurance. 

Nonlinear product A derivative product that is not linearly dependent on the under- 
lying variables. 

Nonperforming Loan A loan where interest is more than 90 days overdue. 

Nonsystematic Risk Risk that can be diversified away. 

Normal Distribution The standard bell-shaped distribution of statistics. 

Normal Market A market where futures prices increase with maturity. 

Notional Principal The principal used to calculate payments in a derivatives transaction. 
The principal is notional because it is neither paid nor received. 

Numerical Procedure A method of calculation when no formula is available. 

Objective Probability Probability based on data. 

Offer Price See Ask Price. 

OIS See Overnight Indexed Swap. 

OIS Rate The fixed rate exchanged for geometric average of overnight interest rates in 
an OIS swap. 

Open-End Fund A mutual fund where the number of shares outstanding increases and 
decreases as investors deposit and withdraw funds. 

Open Interest The total number of long positions outstanding in a futures contract 
(equals the total number of short positions). 

Open Outcry A system of trading where traders meet on the floor of the 
exchange. 
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Operational Risk The risk of loss arising from inadequate or failed internal processes, 
people, and systems or from external events. 

Option The right to buy or sell an asset. 

Organized Trading Facility Electronic platform for standard over-the-counter deriva- 
tives in Europe. 

Originate-to-Distribute Model Phrase used to describe the practice where a bank 
originates loans, credit card receivables, and so on, and securitizes them so that the 
credit risks are distributed to other investors. 

OTC Market See Over-the-Counter Market. 

OTF See Organized Trading Facility. 

Out-of-the-Money Option Either (a) a call option where the asset price is less than the 
strike price or (b) a put option where the asset price is greater than the strike price. 

Outside-Model Hedging Hedging against parameters that are assumed to be constant 
in the model (e.g., hedging against volatility in Black—Scholes—Merton). 

Overcollateralization A situation where total principal of the underlying assets is 
greater than the sum of the principals of the tranches that are created from the assets. 

Overnight Indexed Swap An agreement to exchange the geometric average of 
overnight rates during a certain period for a fixed rate. 

Over-the-Counter Market A market where traders deal directly with each other rather 
than through an exchange. The traders are usually financial institutions, corporations, 
and fund managers. 

P2P Lending Peer-to-peer lending is lending money to a borrower using an online 
platform without a bank or similar intermediary being involved. 

P&L Attribution The analysis of the reason for daily profit and loss. 

Parallel Shift A movement in the yield curve where each point on the curve changes 
by the same amount. 

Partial Duration The percentage change in value of a portfolio for a small change in 
one point on the zero-coupon yield curve. 

Partial Expected Shortfall The expected shortfall for part of a portfolio. 

Partial Simulation Approach An approach for calculating value at risk where Greek 
letters and Taylor series expansions are used to approximate the change in the portfolio 
value. 

Par Value The principal amount of a bond. 

Par Yield The coupon on a bond that makes its price equal the principal. 

Payoff The payment received by the holder of an option or other instrument. 

PD Probability of default. 

Peak Exposure A high percentile (e.g., 97.5) of the estimated exposure distribution at 
a particular future time. 

Performing Loan A loan where interest is not more than 90 days overdue. 

Permanent Life Insurance See Whole Life Insurance. 

Physical Default Probability See Historical Default Probability. 

Plain Vanilla A term used to describe a standard deal. 

Poison Pill Action taken by a company to make a takeover of the company more 


difficult. 
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Poisson Distribution A distribution for number of events in a certain time period in a 
Poisson process. 

Poisson Process A process describing a situation where events happen at random. 
The probability of an event in time At is AAt where A is the intensity of the 
process. 

Policyholder The holder of an insurance policy. 

Portfolio Immunization Making a portfolio relatively insensitive to interest rates. 

Portfolio Insurance Entering into trades to ensure that the value of a portfolio will not 
fall below a certain level. 

Positive Feedback Trading Trading where assets are sold after a price decline and 
bought after a price increase. 

Positive Semidefinite A condition that must be satisfied by a variance—covariance 
matrix for it to be valid. 

Power Law Law describing the tails of many probability distributions that are encoun- 
tered in practice. 

Predatory Trading Trading by someone who thinks another market participant will do 
a similar trade in the near future and that the similar trade, if it happens, will move the 
market. May or may not be legal, depending on the circumstances. 

Premium The price of an option. 

Prime Broker A bank that clears a hedge fund’s trades, lends the hedge fund money, and 
provides other services. 

Principal The par or face value of a debt instrument. 

Principal Components Analysis An analysis aimed at finding a small number of factors 
that describe most of the variation in a large number of correlated variables. (Similar 
to factor analysis.) 

Private Placement Selling a new issue of securities to a small number of large financial 
institutions without offering it to the general public. 

Property—Casualty Insurance Property insurance provides a payout in the event of loss 
of property or damage to property. Casualty insurance provides protection against legal 
liabilities or damage to the property of others. 

Proprietary Trading Trading for a financial institution’s own account rather than the 
account of one of the financial institution’s clients. 

Public Offering Attempting to sell a new issue of securities to the general public. 

Put-Call Parity The relationship between the price of a European call option and the 
price of a European put option when they have the same strike price and maturity 
date. 

Put Option An option to sell an asset for a certain price by a certain date. 

Puttable Bond A bond where the holder has the right to sell it back to the issuer at 
certain predetermined times for a predetermined price. 

Puttable Swap A swap where one side has the right to terminate early. 

QIS See Quantitative Impact Study. 

Quadratic Model A quadratic relationship between change in portfolio value and per- 
centage changes in market variables. 
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Quantitative Impact Study A study by the Basel Committee of the effect of proposed 
new regulations. 

RAROC Risk-adjusted return on capital. 

RCSA Risk control and self-assessment. An approach to assessing operational risk. 

Real World The world we live in. Can be contrasted with the risk-neutral world, which 
is an artificial world used for valuing derivatives. 

Rebalancing The process of adjusting a trading position periodically. Usually the pur- 
pose is to maintain delta neutrality. 

Recovery Rate The amount recovered in the event of a default as a percent of the face 
value. 

Reference Entity A company or country whose default is being insured against in a 
credit default swap. 

RegTech The use of technology by a financial institution to improve compliance with 
regulations. 

Regulatory Arbitrage Transactions designed to reduce the total regulatory capital of 
the financial institutions involved. 

Regulatory Capital Capital a financial institution is required by regulators to keep. 

Rehypothecation The use of collateral posted by one counterparty to satisfy the col- 
lateral requirements of another counterparty. 

Reinsurance A situation where an insurance company passes on some of its risks to 
another company. Of course, it has to pay the other company for taking on the 
risks. 

Reintermediation The development of new intermediaries following disintermedia- 
tion. 

Repo Repurchase agreement. A procedure for borrowing money by selling securities 
to a counterparty and agreeing to buy them back later at a slightly higher price. 

Repo Rate The rate of interest in a repo transaction. 

Reserve Requirement Percentage of deposits that must be kept in cash or on deposit 
with the central bank. 

Reset Date The date in a swap or cap or floor when the floating rate for the next period 
is set. 

Retail Banking Taking relatively small deposits from retail customers and making rela- 
tively small loans to them. 

Reverse Stress Testing Using an algorithm to search for scenarios that lead to large 
losses. 

Reversion Level The level to which the value of a market variable (e.g., a volatility or 
interest rate) tends to revert. 

Reversion Rate The rate at which a market variable is pulled to the reversion level. 

Rho The rate of change of the price of a derivative with the interest rate. 

Right-Way Risk A situation where a counterparty becomes less likely to default as the 
dealer’s exposure to the counterparty increases. 

Risk Appetite A statement by a financial institution describing the level of risk that can 
be tolerated. 
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Risk Culture How risks are taken into account within an organization. An important 
aspect of a company’s risk culture is the extent to which its employees focus on short- 
term profits without considering longer-term risks. 

Risk Factor A term used to describe a market variable that changes through time, such 
as an exchange rate, commodity price, or interest rate. 

Risk Weights In the weighted sensitivity approach, risk weights are a multiple of the 
daily standard deviation or daily volatility of a risk factor. 

Risk-Free Rate The rate of interest that can be earned without assuming any risks. 

Risk-Neutral Default Probability The default probability implied from a credit spread. 

Risk-Neutral Valuation The valuation of an option or other derivative assuming the 
world is risk neutral. Risk-neutral valuation gives the correct price for a derivative in 
all worlds, not just in a risk-neutral world. 

Risk-Neutral World A world where investors are assumed to require no extra return 
on average for bearing risks. 

Risk-Weighted Amount See Risk-Weighted Assets. 

Risk-Weighted Assets Quantity calculated in Basel I and Basel II. Total capital must 
be at least 8% of risk-weighted assets. 

Robo-Advisor Online provider of wealth management services. 

Robotics Process Automation A software application where a computer is trained to 
interact with existing computer systems. 

Roll Back See Backward Induction. 

RSF Factor Required stable funding factor, a weighting factor applied to funding 
requirements in the calculation of the Net Stable Funding Ratio. 

SA-CCR A method for calculating exposure at default when standardized approaches 
are used. It is more elaborate than the current exposure method. 

Sarbanes—Oxley An act passed in the United States in 2002 increasing the responsibil- 
ities of directors, CEOs, and CFOs of public companies. 

Scenario Analysis An analysis of the effects of possible future scenarios on the value of 
a portfolio. Also used to estimate possible operational risk losses. 

Scorecard Approach An assessment procedure used for operational risk. 

SEC Securities and Exchange Commission. 

Securitization A process whereby securities are created from future cash flow streams. 

SEF See Swap Execution Facility. 

Senior Tranche A tranche that is the last to experience losses because of defaults on the 
underlying assets. 

Shadow Banking Bank services provided by non-banks. 

Short Position A position assumed when traders sell shares they do not own. 

Short Selling Selling in the market shares that have been borrowed from another 
investor. 

SIFI Systemically important financial institution. 

SIMM See Standard Initial Margin Model. 

Simulation See Monte Carlo Simulation or Historical Simulation. 

Skewness The measure of the lack of symmetry in a probability distribution. 


Glossary 767 


Solvency I The current regulatory framework for insurance companies in the European 
Union. 

Solvency II A new regulatory framework for insurance companies proposed by the 
European Union. 

Solvency Risk The risk that liabilities will be greater than assets. 

Sovereign Wealth Funds Funds set up to make investments for a country. 

Specific Risk Charge Capital requirement for idiosyncratic risks in the trading book. 

Spectral Risk Measure Risk measure that assigns weights to the quantiles of the loss 
distribution. 

Speculative Grade See Non-Investment Grade. 

Speculator An individual who is taking a position in the market. Usually the individual 
is betting that the price of an asset will go up or that the price of an asset will go down. 

Spot Interest Rate See Zero-Coupon Interest Rate. 

Spot Price The price for immediate delivery. 

Spot Volatilities The volatilities used to price a cap when a different volatility is used 
for each caplet. 

SR 11-7 Guidance on model risk management provided by the Federal Reserve Board. 

Standard Initial Margin Model A model developed by ISDA for calculating initial 
margin required by regulators on a portfolio of bilaterally cleared derivatives between 
two financial institutions. 

Standard Normal Distribution A normal distribution with mean equal to zero and 
standard deviation equal to one. 

Standardized Measurement Approach A new method for calculating operational risk 
capital proposed by the Basel Committee. 

Static Hedge A hedge that does not have to be changed once it is initiated. 

Static Options Replication A procedure for hedging a portfolio that involves finding 
another portfolio of approximately equal value on some boundary. 

Stochastic Variable A variable whose future value is uncertain. 

Stock Index An index monitoring the value of a portfolio of stocks. 

Stock Index Futures Futures on a stock index. 

Stock Index Option An option on a stock index. 

Stock Option An option on a stock. 

Stop-Loss Rule A trading rule where a position is unwound when losses on the position 
reach a certain level. 

Storage Costs The costs of storing a commodity. 

Stressed ES Expected shortfall calculated from a 250-day period of stressed market con- 
ditions that would be particularly bad for the financial institution. 

Stressed VaR Value at risk calculated from a 250-day period of stressed market condi- 
tions that would be particularly bad for the financial institution. 

Stress Testing Testing the impact of extreme but plausible market moves on the value 
of a portfolio. 

Strike Price The price at which the asset may be bought or sold in an option contract. 
(Also called the exercise price.) 
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Structured Product A derivative designed by a financial institution to meet the needs 
of a client. 

Student’s t-Copula Copula based on the multivariate Student’s t-distribution. 

Student’s f-Distribution Distribution with heavier tails than the normal distribution. 

Subjective Probability A probability that reflects a person’s opinion, rather than data. 

Subprime Mortgage A mortgage that is more risky than the average mortgage. 

Supplementary Capital See Tier 2 Capital. 

Survivor Bond See Longevity Bond. 

Swap An agreement to exchange cash flows in the future according to a prearranged 
formula. 

Swap Execution Facility An electronic platform for standard over-the-counter deriva- 
tives in the United States. 

Swap Rate The fixed rate in an interest rate swap that causes the swap to have a value 
of zero. 

Swaption An option to enter into an interest rate swap where a specified fixed rate is 
exchanged for floating. 

Swap Zero Curve See LIBOR/Swap Zero Curve. 

Synthetic CDO A CDO where tranche payments depend on defaults in a bond port- 
folio, but the bond portfolio is not created. 

Synthetic Option An option created by trading the underlying asset. 

Systematic Risk Risk that cannot be diversified away. 

Systemic Risk The risk that a default by one financial institution will lead to defaults 
by other financial institutions, creating risks for the whole financial system. 

Tail Correlation A correlation between the tails of two distributions. Measures the 
extent to which extreme values tend to occur together. 

Tail Dependence The measure of the frequency with which extreme outcomes occur 
for two variables at the same time. 

Tail Loss See Expected Shortfall. 

Taylor Series Expansion For a function of several variables, this relates changes in the 
value of the function to changes in the values of the variables when the changes are 
small. 

Teaser Rate A low interest rate offered for the first two or three years on a mortgage. 

Temporary Life Insurance See Term Life Insurance. 

Terminal Value The value at maturity. 

Term Life Insurance Life insurance that pays off if death occurs before a certain date. 

Term Structure of Interest Rates The relationship between interest rates and their 
maturities. 

Theta The rate of change of the price of an option or other derivative with the passage 
of time. 

Threshold The value of outstanding transactions that leads to collateral being required 
in bilateral OTC trading. 

Tier 1 Equity Capital Equity capital, redefined under Basel III (also called Tier 1 Core 
Capital). 
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Tier 2 Capital Subordinated debt (life greater than five years) and similar sources of 
capital. 

Tier 3 Capital Short-term subordinated debt (life between two and five years). 

Time Decay See Theta. 

Time Value The value of an option arising from the time left to maturity (equals an 
option’s price minus its intrinsic value). 

Total Expense Ratio The ratio of total expenses to premiums for a property—casualty 
insurance company. 

Total Return Swap A swap where the return on an asset such as a bond is exchanged 
for LIBOR plus a spread. The return on the asset includes income such as coupons 
and the change in value of the asset. 

Tracking Error A measure of the error in a trading strategy designed to track a stock 
index. 

Trading Book The part of a bank’s portfolio that consists of instruments that are held 
for resale. 

Trading Days Days when markets are open for trading. 

Tranche One of several securities created from a portfolio. Examples are the tranches of 
an ABS. 

Transaction Costs The cost of carrying out a trade (commissions plus the difference 
between the price obtained and the midpoint of the bid-offer spread). 

Treasury Bill A short-term non-coupon-bearing instrument issued by the government 
to finance its debt. 

Treasury Bond A long-term coupon-bearing instrument issued by the government to 
finance its debt. 

Treasury Note A Treasury bond lasting less than 10 years. 

Tree A representation of the evolution of the value of a market variable for the purposes 
of valuing an option or other derivative. 

Uncleared Trades Trades that are cleared bilaterally rather than through a central coun- 
terparty. 

Unconditional Default Probability The default probability for a future time period 
as seen today. Can be contrasted with a conditional default probability, where the 
probability is conditional on no earlier default. 

Underlying Variable A variable that the price of an option or other derivative depends 
on. 

Unit Trust See Mutual Fund. 

Universal Life Insurance A form of whole life insurance where the premium payments 
can be varied from year to year. The final payout is determined from the premiums, 
the performance of the investment, and costs. 

Unsystematic Risk See Nonsystematic Risk. 

Up-and-In Option An option that comes into existence when the price of the under- 
lying asset increases to a prespecified level. 

Up-and-Out Option An option that ceases to exist when the price of the underlying 
asset increases to a prespecified level. 
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Value at Risk A loss that will not be exceeded at some specified confidence level. 

VaR See Value at Risk. 

Variable Annuity An annuity (usually for life) where the amount received depends on 
the performance of a portfolio. 

Variable Life Insurance A form of whole life insurance where the policyholder decides 
how funds will be invested. If the investments do well, the proceeds can be used to 
reduce premiums. There is a guaranteed minimum payout on death, but the payout 
may be higher if the investments do well. 

Variance-Covariance Matrix A matrix showing variances of, and covariances between, 
a number of different market variables. 

Variance Rate The square of volatility. 

Variation Margin The extra margin required to bring balance in the margin account 
up to the required level. 

Vasicek’s Model A model of default correlation based on the Gaussian copula. (Other 
models developed by Vasicek are also sometimes referred to as Vasicek’s model.) 

Vega The rate of change in the price of an option or other derivative with volatility. 

Vega-Neutral Portfolio A portfolio with a vega of zero. 

VIX Index An index of the implied volatilities of options on the S&P 500. 

Volatility A measure of the uncertainty of the return realized on an asset. 

Volatility Scaling A procedure for scaling a historical daily change up or down to reflect 
differences between the current volatility and the volatility at the time of the historical 
daily change. 

Volatility Skew A term used to describe the volatility smile when it is nonsymmetrical. 

Volatility Smile The variation of implied volatility with strike price. 

Volatility Surface A table showing the variation of implied volatilities with strike price 
and time to maturity. 

Volatility Term Structure The variation of implied volatility with time to maturity. 

Volcker Rule Part of the Dodd-Frank Act in the United States that prohibits proprietary 
trading. 

Waterfall Rules for determining how principal and interest cash flows from an under- 
lying portfolio are distributed to tranches. In a typical arrangement, interest cash flows 
are first used to pay the senior tranche its promised return. The cash flows (if any) 
that are left over are used to provide the next-most-senior tranche with its promised 
return, and so on. Principal cash flows are usually used first to repay the senior tranche, 
then the next-most-senior tranche, and so on. The equity tranche is the last to receive 
both interest and principal cash flows. 

Weather Derivative A derivative where the payoff depends on the weather. 

Weighted Sensitivity Terminology used for delta in the weighted sensitivity approach. 

Whole Life Insurance Life insurance that lasts for the whole life of the insured and so 
is certain to provide a payout. 

Wholesale Banking Taking large deposits and making large loans. 

Within-Model Hedging Hedging only against those variables that the model considers 
as stochastic (cf. Outside-Model Hedging). 
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Worst Case Default Rate (WCDR) The default rate that will not be exceeded at 
some confidence level. 

Writing an Option Selling an option. 

Wrong-Way Risk A situation where a counterparty becomes more likely to default as 
the dealer’s exposure to the counterparty increases. 

Yield A return provided by an instrument. 

Yield Curve See Term Structure of Interest Rates. 

Zero-Coupon Bond A bond that provides no coupons. 

Zero-Coupon Interest Rate The interest rate that would be earned on a bond that 
provides no coupons. 

Zero-Coupon Yield Curve A plot of the zero-coupon interest rate against time to 
maturity. 

Zero Curve See Zero-Coupon Yield Curve. 

Zero Rate See Zero-Coupon Interest Rate. 

Z-Score A number indicating how likely a company is to default. 


RMEFI Software 


from the author’s website: www-2.rotman.utoronto.ca/~hull/software. 


T he RMFI software version 1.00 that accompanies this book can be downloaded 


Getting Started 


The most difficult part of using software is getting started. Here is a step-by-step guide 


to valuing an option using the RMFI software. 


1. 


Open the Excel file RMFI Software v1.00 that you have downloaded. 


2. If you are using Office 2007, click on Options at the top of your screen (above the 


F column) and then click Enable content. If you are not using Office 2007, make sure 
that your security for Macros is set at medium or low. (You can do this by clicking 
Tools, followed by Macros, followed by Security.) 


. Click on the Equity _FX_Indx_Fut_Opts-Calc worksheet tab at the bottom of the page. 
. Choose Currency as the Underlying Type and Binomial: American as the Option Type. 


Click on the Put button. Leave Imply Volatility unchecked. 


. You are now all set to value an American put option on a currency. There are seven 


inputs: exchange rate, volatility, risk-free rate (domestic), foreign risk-free rate, time 
to expiration (yrs), exercise price, and time steps. Input these in cells D5, D6, D7, D8, 
D13, D14, and D15 as 1.61, 12%, 8%, 9%, 1.0, 1.60, and 4, respectively. 


. Hit Enter on your keyboard and click on Calculate. You will see the price of the 


option in cell D20 as 0.07099 and the Greek letters in cells D21 to D25. 


. Click on Display Tree. You will see the binomial tree used to calculate the option. 
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Next Steps 


You should now have no difficulty valuing other types of options on other underly- 
ings with this worksheet. To imply a volatility, check the Imply Volatility box and input 
the option price in cell D20. Hit Enter and click on Calculate. The implied volatility is 
displayed in cell D6. 

Many different charts can be displayed. To display a chart, you must first choose the 
variable you require on the vertical axis, the variable you require on the horizontal axis, 
and the range of values to be considered on the horizontal axis. Following that, you 
should hit Enter on your keyboard and click on Draw Graph. 

Other points to note about this worksheet are: 


1. For European and American equity options, up to 10 dividends on the underlying 
stock can be input in a table that pops up. Enter the time of each dividend (measured 
in years from today) in the first column and the amount of the dividend in the second 
column. 

2. Up to 500 time steps can be used for the valuation of American options, but only a 
maximum of 10 time steps can be displayed. 

3. Greek letters for all options other than standard calls and puts are calculated by per- 
turbing the inputs, not by using analytic formulas. 

4. For an Asian option, the Current Average is the average price since inception. For a 
new deal (with zero time to inception) the current average is irrelevant. 

5. In the case of a lookback option, Minimum to Date is used when a call is valued and 
Maximum to Date is used when a put is valued. For a new deal these should be set 
equal to the current price of the underlying asset. 

6. Interest rates are continuously compounded. 


CDSs 


The CDS’s worksheet is used to calculate hazard rates from CDS spreads and vice versa. 
Users must input a term structure of interest rates (continuously compounded) and either 
a term structure of CDS spreads or a term structure of hazard rates. (All data for hazard 
rates, CDS spreads, and interest rates must be entered in chronological order.) The initial 
hazard rate applies from time zero to the time specified for the first hazard rate; the 
second hazard rate applies from the time corresponding to the first hazard rate to the 
time corresponding to the second hazard rate; and so on. The calculations are carried 
out assuming that default can occur only at points midway between payment dates. This 
corresponds to the calculations for the example in Appendix K. 


CDOs 


The CDO’s worksheet calculates quotes for the tranches of CDOs from tranche correla- 
tions input by the user (see Appendix L). The attachment points and detachment points 
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for tranches are input by the user. The quotes can be in basis points or involve an upfront 
payment. In the latter case, the spread in basis points is fixed and the upfront payment, 
as a percent of the tranche principal, is either input or implied. (For example, the fixed 
spread for the equity tranche of iTraxx Europe or CDX NA IG is 500 basis points.) The 
number of integration points defines the accuracy of calculations and can be left as 10 
for most purposes. (The maximum is 30.) Data for interest rates and hazard rates must 
be entered in chronological order. The software displays the expected loss as a percent of 
the tranche principal (ExpLoss) and the present value of expected payments (PVPmts) 
at the rate of 10,000 basis points per year. The spread is ExpLoss*10,000/PVPmts. The 
upfront payment is ExpLoss—(Spread*PVPmts/10,000). The worksheet can be used to 
imply either tranche (compound) correlations or base correlations from quotes input by 
the user. For base correlations to be calculated, it is necessary for the first attachment 
point to be 0% and the detachment point for one tranche to be the attachment point for 
the next tranche. 


Equity Option Functions 


The equity option functions worksheet provides functions for valuing a range of different 
options. The functions can be used in conjunction with the user’s own worksheets to 
allow the properties of the options to be investigated. 


Value at Risk 


The value at risk worksheet allows VaR to be calculated for a portfolio of options depen- 
dent on a single stock using the linear model, a model based on the first three moments, 
and an analytic approach where equally likely values for a portfolio are generated. The 
first two approaches are based on the material in Chapter 14. 


Definition of Greek Letters 


The Greek letters are defined as follows. 


Delta: Change in option price per dollar increase in underlying asset. 

Gamma: Change in delta per dollar increase in underlying asset. 

Rho: Change in option price per 1% increase in interest rate (e.g., interest increases from 
5% to 6%). 

Theta: Change in option price per calendar day passing. 

Vega: Change in option price per 1% increase in volatility (e.g., volatility increases from 
20% to 21%). 


Table for N(x) When x > 0 


his table shows values of N(x) for x > 0. The table should be used with inter- 
polation. For example, N(0.6278) = N(0.62) + 0.78[N (0.63) — N(0.62)| = 
0.7324 + 0.78 X (0.7357 — 0.7324) = 0.7350. 


x 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 


0.0 0.5000 0.5040 0.5080 0.5120 0.5160 0.5199 0.5239 0.5279 0.5319 0.5359 
0.1 0.5398 0.5438 0.5478 0.5517 0.5557 0.5596 0.5636 0.5675 0.5714 0.5753 
0.2 0.5793 0.5832 0.5871 0.5910 0.5948 0.5987 0.6026 0.6064 0.6103 0.6141 
0.3 0.6179 0.6217 0.6255 0.6293 0.6331 0.6368 0.6406 0.6443 0.6480 0.6517 
0.4 0.6554 0.6591 0.6628 0.6664 0.6700 0.6736 0.6772 0.6808 0.6844 0.6879 


0.5 0.6915 0.6950 0.6985 0.7019 0.7054 0.7088 0.7123 0.7157 0.7190 0.7224 
0.6 0.7257 0.7291 0.7324 0.7357 0.7389 0.7422 0.7454 0.7486 0.7517 0.7549 
0.7 0.7580 0.7611 0.7642 0.7673 0.7704 0.7734 0.7764 0.7794 0.7823 0.7852 
0.8 0.7881 0.7910 0.7939 0.7967 0.7995 0.8023 0.8051 0.8078 0.8106 0.8133 
0.9 0.8159 0.8186 0.8212 0.8238 0.8264 0.8289 0.8315 0.8340 0.8365 0.8389 


1.0 0.8413 0.8438 0.8461 0.8485 0.8508 0.8531 0.8554 0.8577 0.8599 0.8621 
1.1 0.8643 0.8665 0.8686 0.8708 0.8729 0.8749 0.8770 0.8790 0.8810 0.8830 
1.2 0.8849 0.8869 0.8888 0.8907 0.8925 0.8944 0.8962 0.8980 0.8997 0.9015 
1.3 0.9032 0.9049 0.9066 0.9082 0.9099 0.9115 0.9131 0.9147 0.9162 0.9177 
1.4 0.9192 0.9207 0.9222 0.9236 0.9251 0.9265 0.9279 0.9292 0.9306 0.9319 


1.5 0.9332 0.9345 0.9357 0.9370 0.9382 0.9394 0.9406 0.9418 0.9429 0.9441 
1.6 0.9452 0.9463 0.9474 0.9484 0.9495 0.9505 0.9515 0.9525 0.9535 0.9545 
1.7 0.9554 0.9564 0.9573 0.9582 0.9591 0.9599 0.9608 0.9616 0.9625 0.9633 
1.8 0.9641 0.9649 0.9656 0.9664 0.9671 0.9678 0.9686 0.9693 0.9699 0.9706 
1.9 0.9713 0.9719 0.9726 0.9732 0.9738 0.9744 0.9750 0.9756 0.9761 0.9767 
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TABLE FOR N(X) 


WHEN 


X20 


0.00 


0.01 


0.02 


0.03 


0.04 


0.05 


0.06 


0.07 


0.08 


0.09 


0.9772 
0.9821 
0.9861 
0.9893 
0.9918 


0.9938 
0.9953 
0.9965 
0.9974 
0.9981 


0.9986 
0.9990 
0.9993 
0.9995 
0.9997 


0.9998 
0.9998 
0.9999 
0.9999 
1.0000 
1.0000 


0.9778 
0.9826 
0.9864 
0.9896 
0.9920 


0.9940 
0.9955 
0.9966 
0.9975 
0.9982 


0.9987 
0.9991 
0.9993 
0.9995 
0.9997 


0.9998 
0.9998 
0.9999 
0.9999 
1.0000 
1.0000 


0.9783 
0.9830 
0.9868 
0.9898 
0.9922 


0.9941 
0.9956 
0.9967 
0.9976 
0.9982 


0.9987 
0.9991 
0.9994 
0.9995 
0.9997 


0.9998 
0.9999 
0.9999 
0.9999 
1.0000 
1.0000 


0.9788 
0.9834 
0.9871 
0.9901 
0.9925 


0.9943 
0.9957 
0.9968 
0.9977 
0.9983 


0.9988 
0.9991 
0.9994 
0.9996 
0.9997 


0.9998 
0.9999 
0.9999 
0.9999 
1.0000 
1.0000 


0.9793 
0.9838 
0.9875 
0.9904 
0.9927 


0.9945 
0.9959 
0.9969 
0.9977 
0.9984 


0.9988 
0.9992 
0.9994 
0.9996 
0.9997 


0.9998 
0.9999 
0.9999 
0.9999 
1.0000 
1.0000 


0.9798 
0.9842 
0.9878 
0.9906 
0.9929 


0.9946 
0.9960 
0.9970 
0.9978 
0.9984 


0.9989 
0.9992 
0.9994 
0.9996 
0.9997 


0.9998 
0.9999 
0.9999 
0.9999 
1.0000 
1.0000 


0.9803 
0.9846 
0.9881 
0.9909 
0.9931 


0.9948 
0.9961 
0.9971 
0.9979 
0.9985 


0.9989 
0.9992 
0.9994 
0.9996 
0.9997 


0.9998 
0.9999 
0.9999 
0.9999 
1.0000 
1.0000 


0.9808 
0.9850 
0.9884 
0.9911 
0.9932 


0.9949 
0.9962 
0.9972 
0.9979 
0.9985 


0.9989 
0.9992 
0.9995 
0.9996 
0.9997 


0.9998 
0.9999 
0.9999 
0.9999 
1.0000 
1.0000 


0.9812 
0.9854 
0.9887 
0.9913 
0.9934 


0.9951 
0.9963 
0.9973 
0.9980 
0.9986 


0.9990 
0.9993 
0.9995 
0.9996 
0.9997 


0.9998 
0.9999 
0.9999 
0.9999 
1.0000 
1.0000 


0.9817 
0.9857 
0.9890 
0.9916 
0.9936 


0.9952 
0.9964 
0.9974 
0.9981 
0.9986 


0.9990 
0.9993 
0.9995 
0.9997 
0.9998 


0.9998 
0.9999 
0.9999 
0.9999 
1.0000 
1.0000 


—0.7 
—0.8 


—1.0 


Table for N(x) When x < 0 


his table shows values of N(x) for x <0. The table should be used 
with interpolation. For example, N(—0.1234) = N(—0.12) — 0.34[N(—0.12) — 
N(—0.13)] = 0.4522 — 0.34 x (0.4522 — 0.4483) = 0.4509. 


0.00 


0.5000 
0.4602 
0.4207 
0.3821 
0.3446 


0.3085 
0.2743 
0.2420 
0.2119 
0.1841 


0.1587 
0.1357 
0.1151 
0.0968 
0.0808 


0.0668 
0.0548 
0.0446 
0.0359 
0.0287 


0.01 


0.4960 
0.4562 
0.4168 
0.3783 
0.3409 


0.3050 
0.2709 
0.2389 
0.2090 
0.1814 


0.1562 
0.1335 
0.1131 
0.0951 
0.0793 


0.0655 
0.0537 
0.0436 
0.0351 
0.0281 


0.02 


0.4920 
0.4522 
0.4129 
0.3745 
0.3372 


0.3015 
0.2676 
0.2358 
0.2061 
0.1788 


0.1539 
0.1314 
0.1112 
0.0934 
0.0778 


0.0643 
0.0526 
0.0427 
0.0344 
0.0274 


0.03 


0.4880 
0.4483 
0.4090 
0.3707 
0.3336 


0.2981 
0.2643 
0.2327 
0.2033 
0.1762 


0.1515 
0.1292 
0.1093 
0.0918 
0.0764 


0.0630 
0.0516 
0.0418 
0.0336 
0.0268 


0.04 


0.4840 
0.4443 
0.4052 
0.3669 
0.3300 


0.2946 
0.2611 
0.2296 
0.2005 
0.1736 


0.1492 
0.1271 
0.1075 
0.0901 
0.0749 


0.0618 
0.0505 
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0.0000 
0.0000 


0.0188 
0.0146 
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